La Cina controlla gran parte delle catene di approvvigionamento delle batterie, una preoccupazione che sta diventando sempre più critica per le forze armate statunitensi e le iniziative di intelligenza artificiale.
Elon Musk claims that xAI will have more computing power than everyone else combined within five years
Meta has announced the launch of its new AI-assisted healthcare model, Erkang-Diagnosis-1.1. The model combines a hybrid approach with pre-training and return generation to create a secure, reliable, and professional AI health advisor.
Researchers have developed MicroProbe, a new technology that enables reliability assessment of foundation models using only minimal data.
Researchers have developed a new technology that enables language models to better understand context and relationships between concepts. This innovation could revolutionize the approach to text comprehension problems.
Artificial Intelligence is revolutionizing smart home lighting optimization. The new BitRL-Light model combines Llama with the Deep Q-Network to optimize energy consumption and improve user comfort.
Google allows users to change their Gmail address without creating a new account
La valutazione dei grandi modelli linguistici (LLM) si basa pesantemente su benchmarks standardizzati. Questi benchmarks offrono metriche aggregate utili per una data capacità, ma queste metriche aggregate possono nascondere (i) aree particolari dove i modelli sono deboli ('lacune del modello') e (ii) distorsioni nella copertura dei benchmark stessi ('lacune del benchmark'). Presentiamo un nuovo metodo che utilizza autoencoditori sparsi (SAEs) per scoprire automaticamente entrambi tipi di lacuna. Sfruttando le attivazioni concettuali degli SAE e calcolando i punteggi dei prestazioni salienza-weighted in base a dati benchmark, il metodo pone l'evaluzione sulle rappresentazioni interne del modello ed permette una comparazione tra i benchmarks.
Predicting treatment outcomes for lung cancer remains a challenge due to the sparsity, heterogeneity, and information overload of real-world electronic health data.
This study proposes a multi-agent language framework that enables continual strategy evolution without fine-tuning the language model's parameters. The core idea is to liberate the latent vectors of abstract concepts from traditional static semantic representations, allowing them to be continuously updated through environmental interaction and reinforcement feedback.
A recent study analyzes the stability of transformer-based sentiment models on their ability to adapt to temporal changes in social media flows. The results show significant model instability with accuracy drops reaching 23.4% during event-driven periods. The author proposes four new drift metrics validated on 12,279 authentic social media posts, achieving promising results for production deployment.
A new approach for creating more realistic user simulators that enhance the safety and effectiveness of mental health support chatbots.
The X company has announced today the launch of SA-DiffuSeq, a new approach for long document generation that addresses computational and scalability challenges. The new framework integrates sparse attention to improve sampling efficiency and precision in long-range dependency modeling.
Un nuovo approccio per i modelli neurali controllati differenziali (Neural CDEs) potrebbe rivoluzionare il campo dell'intelligenza artificiale. Questo metodo, che richiede molto meno parametri rispetto agli attuali modelli, offre una soluzione innovativa per analizzare sequenze temporali.
A recent study examined the behavior of large language models in mathematics education compared to expert human tutors. The results show that these models have a similar level of pedagogical quality as experts but use different teaching strategies and linguistic approaches.
A new platform, TokSuite, has been created to study the role of tokenizers in improving LLM models. This technology allows researchers to delve deeper into the impact of tokenizers on model performance.
Recently discovered spurious forgetting is a fundamental challenge for language models. Continual learning is a technique that enables models to adapt to new information, but spurious forgetting can lead to performance degradation.
This article explores the future development of trains in Italy, discussing current trends and innovations that will shape the industry.
Hospitals are losing millions of euros to operating room inefficiencies, but AI is the key to solving complex coordination issues.
Waymo is testing Gemini-powered in-car AI assistant for its robotaxis