Pinokio 2.0: A New Pinokio Browser Version that Lets You Locally Install, Run, and Automate Any AI on Your Computer
Using offline web apps and AI apps often comes with challenges. Users typically need to navigate multiple steps to get an app running. These steps can be confusing and time-consuming,…
NeedleBench: A Customizable Dataset Framework that Includes Tasks for Evaluating the Bilingual Long-Context Capabilities of LLMs Across Multiple Length Intervals
Evaluating the retrieval and reasoning capabilities of large language models (LLMs) in extremely long contexts, extending up to 1 million tokens, is a significant challenge. Efficiently processing long texts is…
EM-LLM: A Novel and Flexible Architecture that Integrates Key Aspects of Human Episodic Memory and Event Cognition into Transformer-based Language Models
Despite their expanding capabilities, large language models (LLMs) need help with processing extensive contexts. These limitations stem from Transformer-based architectures struggling to extrapolate beyond their training window size. Processing long…
Is Generative AI Boosting Individual Creativity but Reducing Collective Novelty?
Innovation and the artistic, musical, and literary expression of human experiences and emotions depend on creativity. However, the idea that material created by humans is inherently better is coming under…
Q-Sparse: A New Artificial Intelligence AI Approach to Enable Full Sparsity of Activations in LLMs
LLMs excel in natural language processing tasks but face deployment challenges due to high computational and memory demands during inference. Recent research [MWM+24, WMD+23, SXZ+24, XGZC23, LKM23] aims to enhance…
Snowflake-Arctic-Embed-m-v1.5 Released: A 109M Parameters Groundbreaking Text Embedding Model with Enhanced Compression and Performance Capabilities
Snowflake recently announced the release of its updated text embedding model, snowflake-arctic-embed-m-v1.5. This model generates highly compressible embedding vectors while maintaining high performance. The model’s most noteworthy feature is its…
From Diagrams to Solutions: MAVIS’s Three-Stage Framework for Mathematical AI
Large Language Models (LLMs) and their multi-modal counterparts (MLLMs) have made significant strides in advancing artificial general intelligence (AGI) across various domains. However, these models face a significant challenge in…
Mistral AI and NVIDIA unveil 12B NeMo model
Mistral AI has announced NeMo, a 12B model created in partnership with NVIDIA. This new model boasts an impressive context window of up to 128,000 tokens and claims state-of-the-art performance…
TSMC forecasts record growth, rejects US joint venture amid AI surge
Taiwan Semiconductor Manufacturing Company (TSMC) has raised its revenue forecast for 2024, citing strong demand for chips in AI applications. The world’s largest contract chipmaker anticipates growth slightly above the…
MMLongBench-Doc: A Comprehensive Benchmark for Evaluating Long-Context Document Understanding in Large Vision-Language Models
Document understanding (DU) focuses on the automatic interpretation and processing of documents, encompassing complex layout structures and multi-modal elements such as text, tables, charts, and images. This task is essential…