Business
Microsoft launches robust AI 'small language model' for researchers
New Delhi, Dec 17
Microsoft has released its newest compact “small language model†titled Phi-2 that continues to perform at par or better than certain larger open-source Llama 2 models with less than 13 billion parameters.
Over the past few months, the Machine Learning Foundations team at Microsoft Research has released a suite of small language models (SLMs) called “Phi†that achieve remarkable performance on a variety of benchmarks.
The first model, the 1.3 billion parameter Phi-1 achieved state-of-the-art performance on Python coding among existing SLMs (specifically on the HumanEval and MBPP benchmarks).
"We are now releasing Phi-2, a 2.7 billion-parameter language model that demonstrates outstanding reasoning and language understanding capabilities, showcasing state-of-the-art performance among base language models with less than 13 billion parameters,†the company said in an update.
Phi-2 is an ideal playground for researchers, including for exploration around mechanistic interpretability, safety improvements, or fine-tuning experimentation on a variety of tasks.
“We have made Phi-2 available in the Azure AI Studio model catalog to foster research and development on language models,†said Microsoft.
The massive increase in the size of language models to hundreds of billions of parameters has unlocked a host of emerging capabilities that have redefined the landscape of natural language processing.
However, a question remains whether such emergent abilities can be achieved at a smaller scale using strategic choices for training, e.g., data selection.
“Our line of work with the Phi models aims to answer this question by training SLMs that achieve performance on par with models of much higher scale (yet still far from the frontier models),†said Microsoft.
The company has also performed extensive testing on commonly used prompts from the research community.
“We observed a behaviour in accordance with the expectation we had given the benchmark results,†said the tech giant.
9 hours ago
Trump, Mamdani bonhomie an unusual photo-op in Oval Office, but how long will truce last?
9 hours ago
Trump Jr grooves with Ranveer Singh at lavish Udaipur wedding as JLo, Bieber join celebrations
12 hours ago
BAPS, United Nations celebrate 30 years of transformative partnership for global harmony
15 hours ago
Ayan Mukerji says 'Love you & Miss you' as he remembers dad Deb Mukherjee on his birth anniversary
15 hours ago
Urmila Matondkar introduces her 'bestest winter essential'
15 hours ago
Tharoor cites Trump-Mamdani interaction to underline need for political cooperation
15 hours ago
ISI steps up effort to build white-collared modules by targeting Indian students abroad
15 hours ago
Prez Murmu participates in Sri Sathya Sai Baba’s birth centenary celebrations in Andhra
15 hours ago
Ready to meet PM Modi to explain Coimbatore, Madurai metro projects: CM Stalin
15 hours ago
No need to do politics on Mandir–Masjid: Former Babri mosque litigant on Trinamool MLA’s remark
15 hours ago
Navy Day 2025 to feature grand operational display of maritime power on Dec 3
15 hours ago
Delhi: AGS arrests accused wanted in attempt-to-murder case in Timarpur
15 hours ago
Govt to ensure uniform safety and health standards for workers
