Phi 4

Phi 4 is a language model designed for GPU-based chat completion tasks and is available through Microsoft Foundry Local. It is part of the Phi family of models developed by Microsoft for local deployment scenarios.

Deployment and Availability

The model is accessed through Microsoft Foundry Local, which provides infrastructure for running language models on GPU hardware. This deployment approach enables users to run chat completion workloads locally rather than relying solely on cloud-based inference endpoints.

Use Cases

Phi 4 is configured for conversational AI applications, supporting chat completion tasks where it can generate contextual responses to user inputs. Its local deployment capability makes it suitable for applications requiring on-device inference or those with specific data residency requirements.

Source Notes