AIREVOLUTION

The Amazon Nova Sonic Model Simplifies Real-time Voice-based Interactions

On 8 April 2025, Amazon announced the Nova Sonic foundation model, which combines speech understanding and speech generation into a single model, enabling more human-like voice-based conversations in AI applications. This new technology not only comprehends what is said but also how it is said—including tone, style, and speech

by poltextLAB AI journalist • Apr 30, 2025

DeepSeek research results LLM

DeepSeek's New Development Targets General and Highly Scalable AI Reward Models

On 8 April 2025, Chinese DeepSeek AI introduced its novel technology, Self-Principled Critique Tuning (SPCT), marking a significant advancement in the reward mechanisms of large language models. SPCT is designed to enhance AI models’ performance in handling open-ended, complex tasks, particularly in scenarios requiring nuanced interpretation of context and user

by poltextLAB AI journalist • Apr 28, 2025

Meta Llama LLM

Meta Unveiled its New Open-Source Multimodal Llama 4 Models

On 5 April 2025, Meta announced its most advanced large language model, Llama 4, which the company says marks the dawn of a new era in multimodal AI innovation. The new model family debuted with two main variants: Llama 4 Scout and Llama 4 Maverick, capable of processing and integrating

by poltextLAB AI journalist • Apr 24, 2025

DeepSeek Claude LLM

DeepSeek’s 685 billion parameter model is competing with Claude 3.7

DeepSeek AI released its latest 685 billion parameter DeepSeek-V3-0324 model on 24 March 2025, positioning it as an open-source alternative to compete with Anthropic’s Claude 3.7 Sonnet model. The new model demonstrates significant advancements in coding, mathematical tasks, and general problem-solving, while being freely available under an MIT

by poltextLAB AI journalist • Apr 11, 2025

Gemini LLM

Google Has Introduced a New Model Family: Gemini 2.5, the Company’s Most Advanced Reasoning Model to Date

Google unveiled the Gemini 2.5 artificial intelligence model family on March 25, 2025, representing the company’s most advanced reasoning AI system to date. The first released version, Gemini 2.5 Pro Experimental, is capable of reasoning before responding, significantly improving performance and accuracy. The model is already available

by poltextLAB AI journalist • Apr 8, 2025

European developments LLM

EuroBERT: A Next-Generation Multilingual Encoder Model Family for Language Technology

EuroBERT, the newly developed multilingual encoder model family, marks a significant advancement in modern language technology. It enables more efficient processing of 15 European and global languages, handling sequences of up to 8,192 tokens. Officially introduced on 10 March 2025, the EuroBERT family was trained on a dataset of

by poltextLAB AI journalist • Apr 7, 2025

Tencent China LLM

Tencent Has Unveiled a New Model: 44% Faster Response Time and Double the Word Generation Speed

On 27 February 2025, Chinese tech giant Tencent unveiled its latest “fast-thinking” artificial intelligence model, the Hunyuan Turbo S. Compared to the DeepSeek R1 model, it boasts a 44% reduction in response time and twice the word generation speed. The new model adopts an innovative Hybrid-Mamba-Transformer architecture, which significantly reduces

by poltextLAB AI journalist • Apr 4, 2025

Microsoft Hugging Face LLM

Microsoft Phi-4: Compact model with multimodal capabilities

In February 2025, Microsoft introduced two new members of the Phi-4 model family, with the Phi-4-multimodal-instruct being particularly noteworthy. Despite having just 5.6 billion parameters, it can simultaneously process text, images, and audio, while its performance in certain tasks remains competitive with models twice its size. The Phi-4-multimodal-instruct was

by poltextLAB AI journalist • Mar 31, 2025

Hungarian developments research results LLM

Corpus Size vs Quality: New Research on the Efficiency of Hungarian Language Models

Hungarian language technology research has reached a significant milestone: a comprehensive study has revealed that a larger corpus size does not necessarily lead to improved performance in morphological analysis. In their study, Andrea Dömötör, Balázs Indig, and Dávid Márk Nemeskey conducted a detailed analysis of three Hungarian-language corpora of varying

by poltextLAB AI journalist • Mar 28, 2025

Mistral European developments LLM

Mistral Unveils Its Enhanced Small 3.1 Multimodal Model

Released on 17 March 2025 under the Apache 2.0 licence, Mistral’s Small 3.1 model marks a significant advancement in artificial intelligence technology. This 24-billion-parameter model delivers improved text generation performance, enhanced multimodal capabilities, and an expanded context window of 128,000 tokens, while maintaining an inference speed

by poltextLAB AI journalist • Mar 28, 2025

Hungarian developments research results LLM

Context Sensitivity of the huBERT Model in Pragmatic Annotation – New Research Findings

Tibor Szécsényi and Nándor Virág, researchers at the University of Szeged, have explored the context sensitivity of the huBERT language model in pragmatic annotation, focusing in particular on the automatic identification of imperative verb functions. Their study, conducted on the MedCollect corpus—a dataset of health-related misinformation—investigates how both

by poltextLAB AI journalist • Mar 27, 2025

Baidu DeepSeek LLM

Baidu Unveils Its New AI Models: ERNIE 4.5 and ERNIE X1

On 16 March 2025, Baidu unveiled its two latest artificial intelligence models: the ERNIE 4.5 multimodal foundation model and the ERNIE X1 reasoning model. These new models represent a significant leap forward in multimodal and reasoning AI while offering solutions that are just a fraction of the cost of

by poltextLAB AI journalist • Mar 26, 2025