Marktechpost AI Research News ⚡ Profile Banner
Marktechpost AI Research News ⚡ Profile
Marktechpost AI Research News ⚡

@Marktechpost

7,361
Followers
1,134
Following
2,245
Media
9,608
Statuses

🐝 AI/ML Research and Dev News Platform (1 million+monthly traffic) | 50k+ ML subreddit | Contact: Asif @marktechpost .com

What is trending in AI?
Joined April 2016
Don't wanna be here? Send us removal request.
@Marktechpost
Marktechpost AI Research News ⚡
6 months
ScrapeGraphAI: A Web Scraping Python Library that Uses LLMs to Create Scraping Pipelines for Websites, Documents, and XML Files Quick read: Github: Colab Notebook: @LangChainAI #artificalintelligence
Tweet media one
2
38
167
@Marktechpost
Marktechpost AI Research News ⚡
3 months
LAMBDA: A New Open-Source, Code-Free Multi-Agent Data Analysis System to Bridge the Gap Between Domain Experts and Advanced AI Models A team of researchers from Hong Kong Polytechnic University has introduced LAMBDA, a new open-source and code-free multi-agent data analysis
Tweet media one
1
55
150
@Marktechpost
Marktechpost AI Research News ⚡
5 months
Researchers at NVIDIA AI Introduce ‘VILA’: A Vision Language Model that can Reason Among Multiple Images, Learn in Context, and Even Understand Videos Quick read: Researchers from NVIDIA and MIT have introduced a novel visual language model (VLM)
Tweet media one
3
34
109
@Marktechpost
Marktechpost AI Research News ⚡
7 months
HuggingFace Introduces Quanto: A Python Quantization Toolkit to Reduce the Computational and Memory Costs of Evaluating Deep Learning Models Quick read: Github: #ArtificialIntelligence
1
31
92
@Marktechpost
Marktechpost AI Research News ⚡
1 year
This AI Paper Proposes a NeRF-based Mapping Method that Enables Higher-Quality Reconstruction and Real-Time Capability Even on Edge Computers Quick Read: Paper: Github: If you like our work, you will love
0
26
89
@Marktechpost
Marktechpost AI Research News ⚡
1 year
1/4 🧵 A new research introduces AttrPrompt, a Language Model as Training Data Generator. This is a game-changer for Zero-Shot Learning, a paradigm that allows AI to understand tasks it's never seen before. 🚀 @yue___yu Quick Read:
3
36
89
@Marktechpost
Marktechpost AI Research News ⚡
5 months
UC Berkeley Researchers Introduce Learnable Latent Codes as Bridges (LCB): A Novel AI Approach that Combines the Abstract Reasoning Capabilities of Large Language Models with Low-Level Action Policies Researchers from the University of California, Berkeley, introduced Latent
1
27
88
@Marktechpost
Marktechpost AI Research News ⚡
6 months
Google AI Introduces CodecLM: A Machine Learning Framework for Generating High-Quality Synthetic Data for LLM Alignment Quick read: Researchers at Google Cloud AI have developed CodecLM, an innovative framework designed to align LLMs with specific user
2
31
86
@Marktechpost
Marktechpost AI Research News ⚡
1 year
How Can Robots Make Better Decisions? MIT and Stanford Researchers Introduce Diffusion-CCSP for Advanced Robotic Reasoning and Planning Quick Read: Paper: Project: If you like our work, you will love our
2
29
79
@Marktechpost
Marktechpost AI Research News ⚡
10 months
Microsoft Researchers Introduce PromptBench: A Pytorch-based Python Package for Evaluation of Large Language Models (LLMs) Quick read: Paper: Github: #ArtificialInteligence #MachineLearning #neural
Tweet media one
0
30
78
@Marktechpost
Marktechpost AI Research News ⚡
1 year
Cerebras Introduces the Bittensor Language Model Named BTLM-3B-8K: A New State-of-The-Art 3B Parameter Open-Source Language Model Quick Read: Paper: Project: If you like our work, you will love our
Tweet media one
6
25
79
@Marktechpost
Marktechpost AI Research News ⚡
2 months
MedGraphRAG: An AI Framework for Improving the Performance of LLMs in the Medical Field through Graph Retrieval Augmented Generation (RAG) A team of researchers from the University of Oxford has developed a unique AI framework called MedGraphRAG to improve Large Language Models’
Tweet media one
2
24
75
@Marktechpost
Marktechpost AI Research News ⚡
8 months
Google DeepMind Introduces Tandem Transformers for Inference Efficient Large Language Models LLMs Quick read: Paper: #ArtificialIntelligence
Tweet media one
2
23
69
@Marktechpost
Marktechpost AI Research News ⚡
2 months
This AI Paper by Meta FAIR Introduces MoMa: A Modality-Aware Mixture-of-Experts Architecture for Efficient Multimodal Pre-training Researchers at Meta introduced MoMa, a novel modality-aware mixture-of-experts (MoE) architecture designed to pre-train mixed-modal, early-fusion
Tweet media one
1
23
70
@Marktechpost
Marktechpost AI Research News ⚡
8 months
Transform Your Understanding of Attention: EPFL’s Cutting-Edge Research Unlocks the Secrets of Transformer Efficiency! Quick read: A groundbreaking study conducted by researchers from the Statistical Physics of Computation Laboratory and the Information
Tweet media one
0
21
68
@Marktechpost
Marktechpost AI Research News ⚡
5 months
Aloe: A Family of Fine-tuned Open Healthcare LLMs that Achieves State-of-the-Art Results through Model Merging and Prompting Strategies Researchers from the Barcelona Supercomputing Center (BSC) and Universitat Politècnica de Catalunya – Barcelona Tech (UPC) have developed the
Tweet media one
6
18
67
@Marktechpost
Marktechpost AI Research News ⚡
6 months
Researchers at Stanford University Introduce Octopus v2: Empowering On-Device Language Models for Super Agent Functionality Quick read: Researchers from Stanford University have introduced Octopus v2, an advanced on-device language model aimed at
Tweet media one
1
14
67
@Marktechpost
Marktechpost AI Research News ⚡
7 months
AgentLite by Salesforce AI Research: Transforming LLM Agent Development with an Open-Source, Lightweight, Task-Oriented Library for Enhanced Innovation Quick read: A research team from Salesforce AI Research presents AgentLite, an open-source AI Agent
1
19
64
@Marktechpost
Marktechpost AI Research News ⚡
7 months
Apple Researchers Propose a Multimodal AI Approach to Device-Directed Speech Detection with Large Language Models Quick read: Paper: #ArtificialIntelligence
Tweet media one
1
18
62
@Marktechpost
Marktechpost AI Research News ⚡
2 months
OpenResearcher: An Open-Source Project that Harnesses AI to Accelerate Scientific Research Researchers from Shanghai Jiao Tong University, Shanghai Artificial Intelligence Laboratory, Fudan University, The Hong Kong Polytechnic University, Hong Kong University of Science and
Tweet media one
1
23
65
@Marktechpost
Marktechpost AI Research News ⚡
8 months
Meta AI Introduces Searchformer for Improving Planning Efficiency: A Transformer Model for Complex Decision-Making Tasks Quick read: The research team at Meta has introduced Searchformer, a novel Transformer model that significantly improves planning
Tweet media one
0
24
62
@Marktechpost
Marktechpost AI Research News ⚡
1 year
This AI Paper Introduces DSPy: A Programming Model that Abstracts Language Model Pipelines as Text Transformation Graphs Quick Read: Paper: Github: If you like our work, you will love our newsletter:
Tweet media one
1
22
58
@Marktechpost
Marktechpost AI Research News ⚡
2 months
Comparative Evaluation of SAM2 and SAM1 for 2D and 3D Medical Image Segmentation: Performance Insights and Transfer Learning Potential Researchers from the University Health Network and the University of Toronto have comprehensively evaluated the Segment Anything Model 2 (SAM2)
1
16
60
@Marktechpost
Marktechpost AI Research News ⚡
6 months
Llama-3-based OpenBioLLM-Llama3-70B and 8B: Outperforming GPT-4, Gemini, Meditron-70B, Med-PaLM-1 and Med-PaLM-2 in Medical-Domain Quick read: Open Medical-LLM Leaderboard: OpenBioLLM-70B project page:
Tweet media one
0
18
60
@Marktechpost
Marktechpost AI Research News ⚡
6 months
Google DeepMind Presents Mixture-of-Depths: Optimizing Transformer Models for Dynamic Resource Allocation and Enhanced Computational Sustainability Quick read: Researchers from Google DeepMind, McGill University, and Mila have introduced a groundbreaking
1
14
57
@Marktechpost
Marktechpost AI Research News ⚡
16 days
Ovis-1.6: An Open-Source Multimodal Large Language Model (MLLM) Architecture Designed to Structurally Align Visual and Textual Embeddings Researchers team from Alibaba Group and Nanjing University introduced a new version of Ovis: Ovis 1.6 is a new multimodal large language
Tweet media one
0
17
58
@Marktechpost
Marktechpost AI Research News ⚡
7 months
Adaptive-RAG: Enhancing Large Language Models by Question-Answering Systems with Dynamic Strategy Selection for Query Complexity Quick read: Researchers from the School of Computing and Graduate School of AI, Korea Advanced Institute of Science and
1
18
56
@Marktechpost
Marktechpost AI Research News ⚡
2 months
Microsoft Researchers Combine Small and Large Language Models for Faster, More Accurate Hallucination Detection Researchers from Microsoft Responsible AI present a robust workflow to address the challenges of hallucination detection in LLMs. This approach aims to balance latency
Tweet media one
0
12
57
@Marktechpost
Marktechpost AI Research News ⚡
2 months
Google DeepMind Researchers Introduce Diffusion Augmented Agents: A Machine Learning Framework for Efficient Exploration and Transfer Learning Researchers from Imperial College London and Google DeepMind have introduced the Diffusion Augmented Agents (DAAG) framework to address
Tweet media one
0
20
57
@Marktechpost
Marktechpost AI Research News ⚡
2 months
Crab Framework Released: An AI Framework for Building LLM Agent Benchmark Environments in a Python-Centric Way Researchers from KAUST, , UTokyo, CMU, Stanford, Harvard, Tsinghua, SUSTech, and Oxford have developed the Crab framework, a novel benchmarking
Tweet media one
1
14
55
@Marktechpost
Marktechpost AI Research News ⚡
2 months
MegaAgent: A Practical AI Framework Designed for Autonomous Cooperation in Large-Scale LLM Agent Systems Researchers from the National University of Singapore, Shanghai Jiao Tong University, the University of California, Berkeley, and the South China University of Technology
Tweet media one
2
24
54
@Marktechpost
Marktechpost AI Research News ⚡
1 month
Google DeepMind Researchers Propose GenRM: Training Verifiers with Next-Token Prediction to Leverage the Text Generation Capabilities of LLMs Researchers from Google DeepMind, University of Toronto, MILA and UCLA have introduced a novel approach called Generative Reward Modeling
Tweet media one
0
13
53
@Marktechpost
Marktechpost AI Research News ⚡
3 months
Internet of Agents (IoA): A Novel Artificial Intelligence AI Framework for Agent Communication and Collaboration Inspired by the Internet 🌐 Internet-Inspired Architecture: Just like how the internet connects people, IoA can connect different AI agents across different
Tweet media one
0
16
51
@Marktechpost
Marktechpost AI Research News ⚡
5 months
Optimizing Agent Planning: A Parametric AI Approach to World Knowledge Quick read: Paper:
Tweet media one
0
12
52
@Marktechpost
Marktechpost AI Research News ⚡
7 months
Google DeepMind Researchers Introduce TacticAI: A New Deep Learning System that is Reinventing Football Strategy Quick read: Football has always been a game of tactical brilliance and strategic genius. From the dugouts of your local parks to the hallowed
1
17
45
@Marktechpost
Marktechpost AI Research News ⚡
2 years
Meet MAGVIT: A Novel Masked Generative Video Transformer To Address AI Video Generation Tasks Quick Read: #artificalintelligence #ArtificialIntelligence #bigdata #MachineLearning #TechNews #Trending
1
14
49
@Marktechpost
Marktechpost AI Research News ⚡
1 year
🚀 Exciting news from the #AI world! Researchers from UC Berkeley and Google have introduced a groundbreaking AI framework that reimagines visual question answering as modular code generation. 📖 Quick read: 🔬 Dive deeper into the paper:
2
13
48
@Marktechpost
Marktechpost AI Research News ⚡
2 months
Turing-Complete-RAG (TC-RAG): A Breakthrough Framework Enhancing Accuracy and Reliability in Medical LLMs Through Dynamic State Management and Adaptive Retrieval Researchers from Peking University, Zhongnan University of Economics and Law, University of Chinese Academy of
Tweet media one
0
16
48
@Marktechpost
Marktechpost AI Research News ⚡
2 months
iAsk Ai Outperforms ChatGPT and All Other AI Models on MMLU Pro Test iAsk Ai has quickly become a leader in AI search. iAsk Ai’s search engine is powered by iAsk Pro, their latest model that has outperformed top competitors like OpenAI’s GPT-4o, Anthropic’s Claude 3.5 Sonnet,
Tweet media one
2
9
48
@Marktechpost
Marktechpost AI Research News ⚡
1 month
LLaMA-Omni: A Novel AI Model Architecture Designed for Low-Latency and High-Quality Speech Interaction with LLMs Researchers from the University of Chinese Academy of Sciences introduced LLaMA-Omni, an innovative model architecture, that has been proposed to overcome the
Tweet media one
0
13
48
@Marktechpost
Marktechpost AI Research News ⚡
6 months
Gradformer: A Machine Learning Method that Integrates Graph Transformers (GTs) with the Intrinsic Inductive Bias by Applying an Exponential Decay Mask to the Attention Matrix Quick read: Researchers from Wuhan University China, JD Explore Academy China,
2
11
47
@Marktechpost
Marktechpost AI Research News ⚡
8 months
Meet CodeMind: A Machine Learning Framework Designed to Gauge the Code Reasoning Abilities of LLMs Quick read: A team of researchers from the University of Illinois at Urbana-Champaign introduced CodeMind, a groundbreaking framework meticulously designed
Tweet media one
0
14
47
@Marktechpost
Marktechpost AI Research News ⚡
3 months
NVIDIA Researchers Introduce Flextron: A Network Architecture and Post-Training Model Optimization Framework Supporting Flexible AI Model Deployment Researchers from NVIDIA and the University of Texas at Austin introduced FLEXTRON, a novel flexible model architecture and
0
19
47
@Marktechpost
Marktechpost AI Research News ⚡
5 months
Prometheus 2: An Open Source Language Model that Closely Mirrors Human and GPT-4 Judgements in Evaluating Other Language Models The research team from KAIST AI, LG AI Research, Carnegie Mellon University, MIT, Allen Institute for AI, and the University of Illinois Chicago
Tweet media one
2
15
46
@Marktechpost
Marktechpost AI Research News ⚡
1 year
Researchers from Yale and Google Introduce HyperAttention: An Approximate Attention Mechanism Accelerating Large Language Models for Efficient Long-Range Sequence Processing Quick Read: Paper: If you like our work, you will love
Tweet media one
1
11
41
@Marktechpost
Marktechpost AI Research News ⚡
5 months
Symbolic Chain-of-Thought ‘SymbCoT’: A Fully LLM-based Framework that Integrates Symbolic Expressions and Logic Rules with CoT Prompting Researchers from the National University of Singapore, the University of California, and the University of Auckland introduce the Symbolic
Tweet media one
0
18
45
@Marktechpost
Marktechpost AI Research News ⚡
7 months
Microsoft AI Proposes CoT-Influx: A Novel Machine Learning Approach that Pushes the Boundary of Few-Shot Chain-of-Thoughts (CoT) Learning to Improve LLM Mathematical Reasoning Quick read: A research team from Hong Kong University and Microsoft has
1
15
44
@Marktechpost
Marktechpost AI Research News ⚡
2 months
Protein Annotation-Improved Representations (PAIR): A Flexible Fine-Tuning Framework that Employs a Text Decoder to Guide the Fine-Tuning Process of the Encoder Researchers from the University of Toronto and the Vector Institute conducted a study that enhanced PLMs by
Tweet media one
0
8
45
@Marktechpost
Marktechpost AI Research News ⚡
2 months
The AI Scientist: The World’s First AI System for Automating Scientific Research and Open-Ended Discovery Researchers from Sakana AI, FLAIR, the University of Oxford, the University of British Columbia, Vector Institute, and Canada CIFAR have developed “The AI Scientist,” a
Tweet media one
0
15
46
@Marktechpost
Marktechpost AI Research News ⚡
7 months
Microsoft Introduces AutoDev: A Fully Automated Artificial Intelligence-Driven Software Development Framework Quick read: Microsoft researchers present AutoDev, which empowers AI agents to tackle a broad spectrum of software engineering tasks
0
17
45
@Marktechpost
Marktechpost AI Research News ⚡
1 year
Meet ResFields: A Novel AI Approach to Overcome the Limitations of Spatiotemporal Neural Fields in Effectively Modeling Long and Complex Temporal Signals Quick Read: Paper: Github: Project:
0
11
43
@Marktechpost
Marktechpost AI Research News ⚡
2 months
Jina AI Introduced ‘Late Chunking’: A Simple AI Approach to Embed Short Chunks by Leveraging the Power of Long-Context Embedding Models The Late Chunking method represents a significant advancement in utilizing the rich contextual information provided by 8192-length embedding
Tweet media one
1
11
44
@Marktechpost
Marktechpost AI Research News ⚡
3 months
Theory of Mind Meets LLMs: Hypothetical Minds for Advanced Multi-Agent Tasks In the ever-evolving landscape of artificial intelligence (AI), the challenge of creating systems that can effectively collaborate in dynamic environments is a significant one. Multi-agent reinforcement
Tweet media one
1
20
44
@Marktechpost
Marktechpost AI Research News ⚡
2 months
Google AI Announces Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters Researchers from UC Berkeley, and Google DeepMind propose an adaptive “compute-optimal” strategy for scaling test-time computing in LLMs. This approach selects the
Tweet media one
0
11
44
@Marktechpost
Marktechpost AI Research News ⚡
5 months
FastGen: Cutting GPU Memory Costs Without Compromising on LLM Quality Researchers from the University of Illinois Urbana-Champaign and Microsoft proposed FastGen, a highly effective technique to enhance the inference efficiency of LLMs without any loss in visible quality, using
Tweet media one
1
15
45
@Marktechpost
Marktechpost AI Research News ⚡
6 months
Meet Mini-Jamba: A 69M Parameter Scaled-Down Version of Jamba for Testing and Has the Simplest Python Code Generation Capabilities Quick read: Project Page: #ArtificialInteligence
1
15
44
@Marktechpost
Marktechpost AI Research News ⚡
1 year
Researchers at NTU Singapore Propose PointHPS: An AI Framework for Accurate Human Pose and Shape Estimation from 3D Point Clouds Quick Read: Paper: Project Page: Github: If you like
0
15
44
@Marktechpost
Marktechpost AI Research News ⚡
1 year
MIT and Harvard Researchers Propose (FAn): A Comprehensive AI System that Bridges the Gap between SOTA Computer Vision and Robotic Systems- Providing an End-to-End Solution for Segmenting, Detecting, Tracking, and Following any Object Quick Read: Paper:
Tweet media one
0
17
43
@Marktechpost
Marktechpost AI Research News ⚡
1 month
Qwen2-VL Released: The Latest Version of the Vision Language Models based on Qwen2 in the Qwen Model Familities Researchers at Alibaba have announced the release of Qwen2-VL, the latest iteration of vision language models based on Qwen2 within the Qwen model family. This new
Tweet media one
0
14
43
@Marktechpost
Marktechpost AI Research News ⚡
2 months
Hugging Face Speech-to-Speech Library: A Modular and Efficient Solution for Real-Time Voice Processing Hugging Face has just introduced a Speech-to-Speech library designed to try to overcome the integrative hardships of such models. The research team has created a modular
0
10
43
@Marktechpost
Marktechpost AI Research News ⚡
8 months
Salesforce Research Introduces AgentOhana: A Comprehensive Agent Data Collection and Training Pipeline for Large Language Model Quick read: A team of researchers from Salesforce Research, USA, has introduced AgentOhana. This comprehensive solution
Tweet media one
2
11
41
@Marktechpost
Marktechpost AI Research News ⚡
1 year
Researchers from Sony Propose BigVSAN: Revolutionizing Audio Quality with Slicing Adversarial Networks in GAN-Based Vocoders Quick Read: Paper: Github: If you like our work, you will love our newsletter:
Tweet media one
1
12
43
@Marktechpost
Marktechpost AI Research News ⚡
7 months
Researchers at Microsoft Propose AllHands: A Novel Machine Learning Framework Designed for Large-Scale Feedback Analysis Through a Natural Language Interface Quick read: Paper: @Microsoft
Tweet media one
1
11
40
@Marktechpost
Marktechpost AI Research News ⚡
1 month
SaRA: A Memory-Efficient Fine-Tuning Method for Enhancing Pre-Trained Diffusion Models Researchers from Shanghai Jiao Tong University and Youtu Lab, Tencent, propose SaRA, a fine-tuning method for pre-trained diffusion models. Inspired by model pruning, SaRA reuses “temporarily
Tweet media one
0
11
43
@Marktechpost
Marktechpost AI Research News ⚡
2 months
Parler-TTS Released: A Fully Open-Sourced Text-to-Speech Model with Advanced Speech Synthesis for Complex and Lightweight Applications Parler-TTS has emerged as a robust text-to-speech (TTS) library, offering two powerful models: Parler-TTS Large v1 and Parler-TTS Mini v1. Both
1
12
43
@Marktechpost
Marktechpost AI Research News ⚡
2 months
Tau’s Logical AI-Language Update – A Glimpse into the Future of AI Reasoning Tau is a logical AI engine that enables the creation of software and AI capable of fully mechanized reasoning, allowing software built with Tau to logically reason over formalized information, deduce
0
11
42
@Marktechpost
Marktechpost AI Research News ⚡
7 months
LlamaFactory: A Unified Machine Learning Framework that Integrates a Suite of Cutting-Edge Efficient Training Methods, Allowing Users to Customize the Fine-Tuning of 100+ LLMs Flexibly Quick read: The researchers from the School of Computer Science and
0
11
39
@Marktechpost
Marktechpost AI Research News ⚡
7 months
Meet Devika: An Open-Source AI Software Engineer that Aims to be a Competitive Alternative to Devin by Cognition AI Quick read: Github: #ArtificialInteligence
0
15
40
@Marktechpost
Marktechpost AI Research News ⚡
29 days
Google DeepMind Researchers Propose Human-Centric Alignment for Vision Models to Boost AI Generalization and Interpretation Researchers from Google DeepMind, Machine Learning Group, Technische Universität Berlin, BIFOLD, Berlin Institute for the Foundations of Learning and Data,
Tweet media one
0
16
42
@Marktechpost
Marktechpost AI Research News ⚡
6 months
How Faithful are RAG Models? This AI Paper from Stanford Evaluates the Faithfulness of RAG Models and the Impact of Data Accuracy on RAG Systems in LLMs Quick read: Stanford researchers have introduced a systematic approach to analyzing how LLMs,
1
16
41
@Marktechpost
Marktechpost AI Research News ⚡
3 months
Q-Sparse: A New Artificial Intelligence AI Approach to Enable Full Sparsity of Activations in LLMs Researchers from Microsoft and the University of Chinese Academy of Sciences have developed Q-Sparse, an efficient approach for training sparsely-activated LLMs. Q-Sparse enables
Tweet media one
1
16
41
@Marktechpost
Marktechpost AI Research News ⚡
6 months
Microsoft’s GeckOpt Optimizes Large Language Models: Enhancing Computational Efficiency with Intent-Based Tool Selection in Machine Learning Systems Quick read: The GeckOpt system, developed by Microsoft Corporation researchers, represents a cutting-edge
1
7
42
@Marktechpost
Marktechpost AI Research News ⚡
3 months
Emergence AI Proposes Agent-E: A Web Agent Achieving 73.2% Success Rate with a 20% Improvement in Autonomous Web Navigation Researchers at Emergence AI introduced Agent-E, a novel web agent designed to overcome the shortcomings of existing systems. Agent-E’s hierarchical
Tweet media one
0
10
42
@Marktechpost
Marktechpost AI Research News ⚡
6 months
Integrating Large Language Models with Graph Machine Learning: A Comprehensive Review Quick read: Paper: #ArtificialIntelligence #DataScience
Tweet media one
1
9
41
@Marktechpost
Marktechpost AI Research News ⚡
4 months
NuMind Releases NuExtract: A Lightweight Text-to-JSON LLM Specialized for the Task of Structured Extraction NuMind introduces NuExtract, a cutting-edge text-to-JSON language model that represents a significant advancement in structured data extraction from text. This model aims
Tweet media one
1
11
41
@Marktechpost
Marktechpost AI Research News ⚡
7 months
Google AI Proposes FAX: A JAX-Based Python Library for Defining Scalable Distributed and Federated Computations in the Data Center Quick read: Paper: Github: #ArtificialIntelligence #pythonprogramming
Tweet media one
2
14
40
@Marktechpost
Marktechpost AI Research News ⚡
2 months
AutoToS: An Automated Feedback System for Generating Sound and Complete Search Components in AI Planning Researchers from Cornell University and IBM Research introduced AutoToS, designed from the ground up to generate sound and complete search components without human oversight
Tweet media one
0
11
41
@Marktechpost
Marktechpost AI Research News ⚡
3 months
DeepSeek-V2-0628 Released: An Improved Open-Source Version of DeepSeek-V2 Read our take on this: Model Card: API Access: DeepSeek-V2-Chat-0628 is an enhanced iteration of the previous DeepSeek-V2-Chat
Tweet media one
0
15
40
@Marktechpost
Marktechpost AI Research News ⚡
5 months
This AI Paper by Microsoft and Tsinghua University Introduces YOCO: A Decoder-Decoder Architectures for Language Models Microsoft Research and Tsinghua University researchers have introduced a novel architecture, You Only Cache Once (YOCO), for large language models. The YOCO
Tweet media one
0
13
41
@Marktechpost
Marktechpost AI Research News ⚡
2 months
InfinityMath: A Scalable Instruction Tuning Dataset for Programmatic Mathematical Reasoning A research team from the Beijing Academy of Artificial Intelligence and China University of Mining & Technology has proposed a scalable dataset for programmatic mathematical reasoning
Tweet media one
0
14
41
@Marktechpost
Marktechpost AI Research News ⚡
7 months
Efficiency Breakthroughs in LLMs: Combining Quantization, LoRA, and Pruning for Scaled-down Inference and Pre-training Quick read: Researchers from Meta FAIR, UMD, Cisco, Zyphra, MIT, and Sequoia Capital examine a layer-pruning approach for popular
1
11
37
@Marktechpost
Marktechpost AI Research News ⚡
2 months
Arcee AI Introduces Arcee Swarm: A Groundbreaking Mixture of Agents MoA Architecture Inspired by the Cooperative Intelligence Found in Nature Itself Arcee AI, an artificial intelligence AI company focussing specially on small language models, is introducing its first-of-its-kind
0
11
40
@Marktechpost
Marktechpost AI Research News ⚡
3 months
Released DataChain: A Groundbreaking Open-Source Python Library for Large-Scale Unstructured Data Processing and Curation has announced the release of DataChain, a revolutionary open-source Python library designed to handle and
Tweet media one
0
8
40
@Marktechpost
Marktechpost AI Research News ⚡
2 months
Meta presents Self-Taught Evaluators: A New AI Approach that Aims to Improve Evaluators without Human Annotations and Outperforms Commonly Used LLM Judges Such as GPT-4 Researchers at Meta FAIR have introduced a novel approach called the “Self-Taught Evaluator.” This method
Tweet media one
0
14
40
@Marktechpost
Marktechpost AI Research News ⚡
6 months
Researchers at Apple Propose MobileCLIP: A New Family of Image-Text Models Optimized for Runtime Performance through Multi-Modal Reinforced Training Quick read: Paper: Github: @Apple
0
12
40
@Marktechpost
Marktechpost AI Research News ⚡
8 months
Alibaba Researchers Introduce Mobile-Agent: An Autonomous Multi-Modal Mobile Device Agent Quick read: Paper: Github: #ArtificialInteligence @AlibabaGroup @alibaba_cloud #LLMs
Tweet media one
1
13
38
@Marktechpost
Marktechpost AI Research News ⚡
1 year
1/4 🧵 Exciting news in the world of #AI ! Introducing Blended-NeRF, a revolutionary AI model that's like a magic brush for 3D object generation. It's a game-changer for neural radiance fields. 🎨🖌️ #ArtificialIntelligence #NeRF #3DModeling
3
15
40
@Marktechpost
Marktechpost AI Research News ⚡
6 months
Tencent AI Lab Developed AlphaLLM: A Novel Machine Learning Framework for Self-Improving Language Models Quick read: Researchers from Tencent AI lab have introduced ALPHALLM, a novel framework that integrates MCTS with LLMs to promote self-improvement
1
13
39
@Marktechpost
Marktechpost AI Research News ⚡
5 months
Data Complexity and Scaling Laws in Neural Language Models Quick read: Paper: GitHub: @ReworkdAI @khoomeik
Tweet media one
1
8
39
@Marktechpost
Marktechpost AI Research News ⚡
4 months
GraphReader: A Graph-based AI Agent System Designed to Handle Long Texts by Structuring them into a Graph and Employing an Agent to Explore this Graph Autonomously Quick read: Paper:
Tweet media one
0
18
39
@Marktechpost
Marktechpost AI Research News ⚡
6 months
AURORA-M: A 15B Parameter Multilingual Open-Source AI Model Trained in English, Finnish, Hindi, Japanese, Vietnamese, and Code Quick read: Paper: HF Page:
1
17
38