Astronomer Builds on Strong Customer Momentum, Appoints New SVP of R&D
Astronomer, the company behind Astro, the leading fully managed service for Apache Airflow, today announced significant company and customer momentum as demand for data orchestration tools surge to fuel data…
TopicGPT: A Prompt-based AI Framework that Uses Large Language Models (LLMs) to Uncover Latent Topics in a Text Collection
Topic modeling is a technique to uncover the underlying thematic structure in large text corpora. Traditional topic modeling methods, such as Latent Dirichlet Allocation (LDA), have limitations in terms of…
This AI Paper Presents a Direct Experimental Comparison between 8B-Parameter Mamba, Mamba-2, Mamba-2-Hybrid, and Transformer Models Trained on Upto 3.5T Tokens
Transformer-based Large Language Models (LLMs) have emerged as the backbone of Natural Language Processing (NLP). These models have shown remarkable performance over a variety of NLP tasks. The creative self-attention…
Enhancing Mathematical Reasoning in LLMs: Integrating Monte Carlo Tree Search with Self-Refinement
With the rapid advancements in artificial intelligence, LLMs such as GPT-4 and LLaMA have significantly enhanced natural language processing. These models, boasting billions of parameters, excel in understanding and generating…
Microsoft Research Launches AutoGen Studio: A Low-Code Platform Revolutionizing Multi-Agent AI Workflow Development and Deployment
Microsoft Research has announced the release of AutoGen Studio, a low-code interface designed to streamline the creation, testing, and deployment of multi-agent AI workflows. Building on the success of the…
Meet DeepSeek-Coder-V2 by DeepSeek AI: The First Open-Source AI Model to Surpass GPT4-Turbo in Coding and Math, Supporting 338 Languages and 128K Context Length
Code intelligence focuses on creating advanced models capable of understanding and generating programming code. This interdisciplinary area leverages natural language processing and software engineering to enhance programming efficiency and accuracy.…
Advances in Bayesian Deep Neural Network Ensembles and Active Learning for Preference Modeling
Machine learning has seen significant advancements in integrating Bayesian approaches and active learning methods. Two notable research papers contribute to this development: “Bayesian vs. PAC-Bayesian Deep Neural Network Ensembles” by…
NVIDIA AI Releases HelpSteer2 and Llama3-70B-SteerLM-RM: An Open-Source Helpfulness Dataset and a 70 Billion Parameter Language Model Respectively
Nvidia recently announced the release of two groundbreaking technologies in artificial intelligence: HelpSteer2 and Llama3-70B-SteerLM-RM. These innovations promise to significantly enhance the capabilities of AI systems in various applications, from…
IBM and Wimbledon Collaborate to Provide Tennis Fans with Enhanced Digital Experience
IBM and The All England Lawn Tennis Club (AELTC) are collaborating to offer fans a more immersive and dynamic Wimbledon digital experience through a new feature called “Catch Me Up”.…
GARP Launches Certificate Program on Risk and Artificial Intelligence
New Risk and AI (RAI) Certificate is designed to help professionals understand and manage the risks associated with AI The Global Association of Risk Professionals (GARP) announced the launch of…