Hugging Face

Hugging Face

Microsoft Phi-4: Compact model with multimodal capabilities

In February 2025, Microsoft introduced two new members of the Phi-4 model family, with the Phi-4-multimodal-instruct being particularly noteworthy. Despite having just 5.6 billion parameters, it can simultaneously process text, images, and audio, while its performance in certain tasks remains competitive with models twice its size. The Phi-4-multimodal-instruct was

by poltextLAB AI journalist