• Thu. Nov 28th, 2024

Innodata’s Comprehensive Benchmarking of Llama2, Mistral, Gemma, and GPT for Factuality, Toxicity, Bias, and Hallucination Propensity

In a recent study by Innodata, various large language models (LLMs) such as Llama2, Mistral, Gemma, and GPT were benchmarked for their performance in factuality, toxicity, bias, and propensity for…

This AI Research from Tenyx Explore the Reasoning Abilities of Large Language Models (LLMs) Through Their Geometrical Understanding

Large language models (LLMs) have demonstrated remarkable performance across various tasks, with reasoning capabilities being a crucial aspect of their development. However, the key elements driving these improvements remain unclear.…

D-Rax: Enhancing Radiologic Precision through Expert-Integrated Vision-Language Models

VLMs like LLaVA-Med have advanced significantly, offering multi-modal capabilities for biomedical image and data analysis, which could aid radiologists. However, these models face challenges, such as hallucinations and imprecision in…

Researchers at the University of Manchester Proposes ESBMC-Python: The First BMC-based Python-code Verifier for Formal Verification of Python Programs

Formal verification is crucial in software engineering to ensure program correctness through mathematical proof. One widely used technique for this purpose is bounded model checking (BMC), which involves verifying the…

Generative AI to Account for 1.5% of World’s Power Consumption by 2029

Generative AI will take on a larger chunk of the world’s power consumption to keep up with the hefty hardware requirements to run applications. “AI chips represent 1.5% of electricity use over the…

This AI Paper from Cohere for AI Presents a Comprehensive Study on Multilingual Preference Optimization

Multilingual natural language processing (NLP) is a rapidly advancing field that aims to develop language models capable of understanding & generating text in multiple languages. These models facilitate effective communication…

Yes, Big Data Is Still a Thing (It Never Really Went Away)

A funny thing happened on the way to the AI promised land: People realized they need data. In fact, they realized they need large quantities of a wide variety of…

aiOla’s AI adapts to any industry’s jargon with additional training

aiOla’s proprietary model significantly improves on OpenAI’s Whisper, achieving a 45% increase in speech recognition accuracy when transcribing domain-specific dialogue aiOla, a leader in speech recognition technology, has announced a new…

Salesforce Research Introduces INDICT: A Groundbreaking Framework Enhancing the Safety and Helpfulness of AI-Generated Code Across Diverse Programming Languages

The ability to automate and assist in coding has the potential to transform software development, making it faster and more efficient. However, ensuring these models produce helpful and secure code…

This Machine Learning Research Attempts to Formalize Generalization in the Context of GFlowNets and to Link Generalization with Stability

Generative Flow Networks (GFlowNets) address the complex challenge of sampling from unnormalized probability distributions in machine learning. By learning a policy on a constructed graph, GFlowNets facilitates efficient sampling through…