Microsoft releases Phi-2, a powerful small language model AI

Key Takeaways:

– Microsoft Research has announced the release of its Phi-2 small language model (SML), a text-to-text AI program that can run on a laptop or mobile device.
– Phi-2 has 2.7 billion parameters and performs comparably to larger models like Meta’s Llama 2-7B and Mistral-7B.
– Phi-2 outperforms Google’s Gemini Nano 2 model in terms of toxicity and bias in responses.
– Phi-2 was able to correctly answer a physics question prompt and correct a student’s mistake, similar to Google’s Gemini Ultra model.
– However, Phi-2 is currently only licensed for research purposes and cannot be used commercially.

VentureBeat:

Are you ready to bring more awareness to your brand? Consider becoming a sponsor for The AI Impact Tour. Learn more about the opportunities here.


The rapid pace of generative AI news and announcements isn’t slowing down, even as we reach the final stretches of 2023 and the traditional winter holiday quiet period.

Just take a look at Microsoft Research, the blue sky division of the software giant, which today announced the release of its Phi-2 small language model (SML), a text-to-text AI program that is “small enough to run on a laptop or mobile device,” according to a post on X.

At the same time, Phi-2 with its 2.7 billion parameters (connections between artificial neurons) boasts performance that is comparable to other, much larger models including Meta’s Llama 2-7B with its 7 billion parameters and even Mistral-7B, another 7 billion parameter model.

Chart comparing Microsoft Research Phi-2 model to other leading open source and closed source models. Credit: Microsoft Research

Microsoft researchers also noted in their blog post on the Phi-2 release that it outperforms Google’s brand new Gemini Nano 2 model despite it having half a billion more parameters, and delivers less “toxicity” and bias in its responses than Llama 2.

VB Event

The AI Impact Tour

Connect with the enterprise AI community at VentureBeat’s AI Impact Tour coming to a city near you!

 


Learn More

Microsoft also couldn’t resist taking a little dig at Google’s now much-criticized, staged demo video for Gemini in which it showed off how its forthcoming largest and most powerful new AI model, Gemini Ultra, was able to solve fairly complex physics problems and even correct students’ mistakes on them. As it turned out, even though it is likely a fraction of the size of Gemini Ultra, Phi-2 also was able to correctly answer the question and correct the student using the same prompts.

Promotional screenshot showing Phi-2’s answer to a physics question prompt. Credit: Microsoft Research

However, despite these encouraging findings, there is a big limitation with Phi-2, at least for the time being: it is licensed only for “research purposes only,” not commercial usage, under a custom Microsoft Research License, which further states Phi-2 may only be used for “non-commercial, non-revenue generating, research purposes.” So, businesses looking to build products atop it are out of luck.

VentureBeat’s mission is to be a digital town square for technical decision-makers to gain knowledge about transformative enterprise technology and transact. Discover our Briefings.


Source link

AI Eclipse TLDR:

Microsoft Research has announced the release of its Phi-2 small language model (SML), a text-to-text AI program that is small enough to run on a laptop or mobile device. Despite its smaller size, Phi-2 boasts performance comparable to much larger models, including Meta’s Llama 2-7B and Mistral-7B. Microsoft researchers stated that Phi-2 outperforms Google’s Gemini Nano 2 model and delivers less toxicity and bias in its responses than Llama 2. However, Phi-2 is currently licensed only for research purposes and cannot be used for commercial applications.