Azure AI Foundry: New Models & Tools | en

Stepping into the Future with GPT-4.5

A cornerstone of the advancements within Azure AI Foundry is the introduction of GPT-4.5, currently available in preview on Azure OpenAI Service. This latest iteration builds upon the success of its predecessors, representing the most potent general-purpose model available. Its development signifies a major stride in unsupervised learning techniques, achieved through scaling both pre-and post-training.

GPT-4.5 elevates the user experience with more natural interactions. Its expanded knowledge base and enhanced ‘EQ’ contribute to improved performance in coding, writing, and problem-solving tasks. The model’s capabilities are evident in several key areas:

Enhanced Accuracy: Developers can rely on GPT-4.5 for more precise and relevant responses. This is demonstrated by a significantly lower hallucination rate (37.1% vs. 61.8%) and higher accuracy (62.5% vs. 38.2%) compared to GPT-4o. This improvement means fewer errors and more reliable outputs, crucial for applications requiring high precision.
Improved Human Alignment: Refined alignment techniques enable GPT-4.5 to better follow instructions, understand nuances, and engage in natural conversations. This makes it a more effective tool for tasks like coding and project management, where understanding context and intent is paramount. The model’s ability to grasp subtle cues and respond appropriately enhances its usability and effectiveness in collaborative scenarios.

The versatility of GPT-4.5 opens doors to a wide range of applications, boosting both productivity and creativity across various domains:

Communication Enhancement: Users can leverage GPT-4.5 to draft clear and effective emails, messages, and documentation. The model’s improved language understanding and generation capabilities ensure that communications are concise, coherent, and tailored to the intended audience.
Personalized Learning: The model facilitates personalized learning and coaching experiences, aiding users in acquiring new skills or deepening their knowledge in specific areas. GPT-4.5 can adapt to individual learning styles and paces, providing customized feedback and guidance.
Creative Ideation: During brainstorming sessions, GPT-4.5 serves as a valuable tool for generating innovative ideas and solutions. Its ability to connect seemingly disparate concepts and explore diverse perspectives can spark creativity and lead to novel approaches.
Project Management Assistance: GPT-4.5 assists in organizing tasks, ensuring thorough and efficient approaches to project planning and execution. It can help break down complex projects into manageable steps, track progress, and identify potential roadblocks.
Complex Task Automation: The model simplifies intricate processes and workflows by handling complex task automation. This frees up human workers to focus on higher-level tasks that require critical thinking and creativity.
Streamlined Coding Workflows: Developers can benefit from step-by-step guidance and automation of repetitive tasks, saving time and reducing errors. GPT-4.5 can assist with code completion, debugging, and documentation, accelerating the development process.

Enterprise customers can now access GPT-4.5 within Azure AI Foundry, unlocking its potential to reshape various facets of their operations, from customer service to internal processes. The model’s enhanced capabilities promise to drive efficiency, improve decision-making, and foster innovation.

A New Wave of Specialized AI Models

The latest advancements in AI models, showcased within Azure AI Foundry, share a common thread: a focus on delivering specialized capabilities with enhanced efficiency. This trend signals a shift towards purpose-built AI, designed to excel in specific domains while minimizing computational resource requirements. This approach allows for faster processing, reduced energy consumption, and greater cost-effectiveness. Azure AI Foundry highlights several key launches in this area:

Microsoft’s Phi Models: Pushing the Boundaries of Efficiency

Microsoft’s Phi models continue to redefine what’s possible with smaller, more efficient architectures. These models are designed to deliver high performance while requiring significantly less computational power than their larger counterparts. This makes them ideal for deployment in resource-constrained environments, such as edge devices and mobile applications.

Phi-4-multimodal: This model seamlessly integrates text, speech, and vision, enabling context-aware interactions. Imagine retail kiosks diagnosing product issues through camera and voice inputs, eliminating the need for cumbersome manual descriptions. This multimodal capability allows for more natural and intuitive interactions, bridging the gap between human and machine communication. For example, a user could show a faulty product to a kiosk and describe the problem verbally, and the kiosk would be able to understand the issue and provide assistance.
Phi-4-mini: Surpassing larger models in coding and math tasks, Phi-4-mini boasts a 30% increase in inference speed compared to its predecessors. This enhanced speed allows for faster processing of code and mathematical problems, making it a valuable tool for developers and researchers. Despite its smaller size, Phi-4-mini demonstrates that efficiency does not have to come at the cost of performance.

Stability AI: Advancing Generative Imaging

Stability AI propels creative workflows forward with models designed for accelerated asset generation. These models are specifically tailored for tasks such as image creation and manipulation, enabling users to generate high-quality visuals quickly and efficiently.

Stable Diffusion 3.5 Large: This model generates high-fidelity marketing assets with greater speed than previous versions, ensuring brand consistency across diverse visual styles. This allows marketers to create compelling visuals for their campaigns without sacrificing quality or consistency. The model’s ability to generate images in various styles ensures that the visuals align with the brand’s identity.
Stable Image Ultra: Achieve photorealism in product imagery, reducing the need for costly photoshoots through accurate material rendering and color fidelity. This model can generate images that are virtually indistinguishable from real photographs, saving businesses time and money on product photography. The accurate rendering of materials and colors ensures that the images are visually appealing and accurately represent the products.
Stable Image Core: An enhanced iteration of SDXL (Stability AI’s text-to-image generative AI model), Stable Image Core delivers high-quality output with exceptional speed and efficiency. This model combines the strengths of SDXL with improved performance, making it a powerful tool for generating images from text descriptions. The speed and efficiency of Stable Image Core allow users to generate images quickly and easily, without compromising on quality.

Cohere: Enhancing Information Retrieval

Cohere elevates information retrieval with its latest ranking technology, designed to improve the accuracy and relevance of search results.

Cohere ReRank v3.5: This model delivers more accurate search results, leveraging a 4,096-token context window and supporting over 100 languages. It excels at surfacing relevant content even without exact keyword matches. This means that users can find the information they need even if they don’t know the precise keywords to use. The model’s ability to understand the context of a query and its support for multiple languages make it a powerful tool for information retrieval across diverse datasets.

Expanding the GPT-4o Family

The GPT-4o family grows with two specialized variants, further expanding the capabilities of this powerful model.

GPT-4o-Audio-Preview: This model handles audio prompts and generates spoken responses with appropriate emotion and emphasis, making it ideal for digital assistants and customer service applications. The model’s ability to understand and generate spoken language with natural intonation and emotion makes it a more engaging and effective tool for human-computer interaction.
GPT-4o-Realtime-Preview: Experience truly human-like interaction flows with breakthrough latency reduction, eliminating conversational lag. This model is designed to respond to user input in real-time, creating a more natural and seamless conversational experience. The elimination of conversational lag makes interactions feel more fluid and responsive, enhancing the user experience.

These collective advancements mark a significant evolution in AI, fostering more natural, responsive, and efficient interactions across a wide spectrum of use cases and deployment environments. The focus on specialized models and enhanced efficiency reflects the growing demand for AI solutions that are tailored to specific tasks and can be deployed in a variety of settings.

Empowering Customization with Advanced Tools

As the model library within Azure AI Foundry expands beyond 1,800 offerings, the platform continues to lead in experimentation and observability. A new suite of fine-tuning tools complements the rise of unsupervised learning techniques, providing developers with greater control over model behavior and performance. These tools allow for the customization of models to meet specific needs and requirements, further enhancing their effectiveness in various applications.

Distillation Workflows: Azure OpenAI Service introduces a code-first approach to model distillation with the Stored Completions API and SDK. This allows smaller models to inherit knowledge from larger counterparts, like GPT-4.5. The result is reduced cost and latency while maintaining high performance for specific tasks. Distillation is a technique that transfers knowledge from a larger, more complex model (the teacher) to a smaller, more efficient model (the student). This allows the student model to achieve comparable performance to the teacher model while requiring fewer computational resources.
Reinforcement Fine-tuning: Currently in private preview, this technique teaches models to reason in novel ways. It rewards correct logical paths while penalizing incorrect reasoning, leading to enhanced problem-solving capabilities. Reinforcement learning is a type of machine learning where an agent learns to make decisions by interacting with an environment. In this case, the model is rewarded for making correct decisions and penalized for making incorrect decisions, allowing it to learn and improve its reasoning abilities over time.
Provisioned Deployment for Fine-tuning: Azure OpenAI Service now offers Provisioned Deployments for fine-tuned models. This ensures predictable performance and costs through Provisioned Throughput Units (PTUs), in addition to token-based billing. Provisioned Deployments provide a dedicated amount of computing resources for fine-tuned models, ensuring consistent performance and predictable costs. This is particularly important for applications that require high availability and low latency.
Fine-tuning for Mistral Models: Exclusively available in Azure AI Foundry, Mistral Large 2411 and Ministral 3B now support fine-tuning for industry-specific tasks. An example of such specific task is healthcare document redaction. Fine-tuning allows developers to adapt Mistral models to specific tasks and datasets, improving their performance and accuracy in those areas. This customization capability makes Mistral models more versatile and applicable to a wider range of real-world scenarios.

These advanced tools empower developers to tailor AI models to their specific needs, optimizing performance and efficiency for a variety of applications. The combination of unsupervised learning techniques and fine-tuning capabilities provides a powerful toolkit for creating customized AI solutions.

Fortifying Enterprise Agents with Security and Scalability

In today’s enterprise landscape, security and scalability are not just desirable features – they are strategic imperatives. AI systems must be able to handle large volumes of data and complex tasks while maintaining the highest levels of security and data privacy. Azure AI Foundry introduces two powerful features to securely harness AI for mission-critical tasks:

Bring Your Own VNet: Azure AI Agent Service now enables all AI agent interactions, data processing, and API calls to remain securely within an organization’s own virtual network. This eliminates exposure to the public internet, safeguarding sensitive data. This feature provides an additional layer of security by isolating AI agent operations within a private network, preventing unauthorized access and protecting sensitive data from external threats. Early adopters, like Fujitsu, are leveraging this capability to significantly improve sales performance. Their sales proposal creation agent, powered by this feature, has boosted sales by 67% while saving countless hours. This allows for a redirection of resources towards customer engagement and strategic planning, all while maintaining data integrity.
Magma (Multi-Agent Goal Management Architecture): Available through Azure AI Foundry Labs, Magma revolutionizes complex workflow orchestration. It achieves this by coordinating hundreds of AI agents in parallel. This architecture enables tackling large-scale challenges, such as supply chain optimization, with unprecedented speed and accuracy. It effectively bridges the physical and digital agentic world. Magma is readily available for experimentation within Azure AI Foundry. Magma allows for the coordination of multiple AI agents to work together on complex tasks, enabling the automation of large-scale processes and workflows. This capability is particularly valuable for applications that require the coordination of numerous agents, such as supply chain management, logistics, and resource allocation.

The introduction of the features listed above, and the improvement of already existing ones, is a testament to Microsoft’s commitment to developing AI. The constant evolution of AI is beneficial to many industries, and it is a force that has come to stay. These enhancements demonstrate Microsoft’s dedication to providing secure, scalable, and reliable AI solutions for enterprise customers. The focus on security and scalability ensures that AI can be deployed in mission-critical applications with confidence, protecting sensitive data and ensuring reliable performance.

updated at 2025-02-28

# Agent # GPT # Microsoft