AI PulseAI Market Pulse

Market dataTradingView

Intelligence Terminal

Search AI News

Source-backed intelligence across model releases, research, policy, tools, funding, and companies.

Loading latest AI news...

Intelligence Terminal

Search AI News

Source-backed intelligence across model releases, research, policy, tools, funding, and companies.

13 signals found for "Fine-tuning"

modelsarXiv (Zilin Xiao)12d ago

45SIG

90CONF

Learning to Reason by Analogy via Retrieval-Augmented Reinforcement Fine-Tuning

Retrieval-augmented generation (RAG) has become a standard mechanism for grounding language models in external knowle...

Qwen

Models

Qwen

Read Brief Source

modelsHugging Face (Sanket Badhe)22d ago

65SIG

80CONF

Prompt-Level Distillation: A Non-Parametric Alternative to Model Fine-Tuning for Efficient Reasoning

Advanced reasoning typically requires Chain-of-Thought prompting, which is accurate but incurs prohibitive latency an...

Mistral

Models

Mistral

Read Brief Source

modelsNVIDIA Developer Blog8d ago

65SIG

80CONF

Fine-Tuning Biological Foundation Models with LoRA Using NVIDIA BioNeMo Recipes

Foundation models are reshaping computational biology. Pretrained on massive corpora of protein or genomic sequences,...

Nvidia

Models

Nvidia

Read Brief Source

researcharXiv (Haotao Xie)13d ago

45SIG

90CONF

System Report for CCL25-Eval Task 5: New Dataset and LoRA-Fine-Tuned Qwen2.5

Recently, large language models (LLMs) have achieved promising progress in the fields of classical Chinese translatio...

Research

Read Brief Source

modelsGitHub (vachhanidhyey)10d ago

55SIG

33CONF

vachhanidhyey/llm_intercept released an update

🚀 Intercept and log LLM calls to build fine-tuning datasets for compact models, enabling cost-effective and efficien...

Models

LLM

Read Brief Source

modelsHugging Face (Yalun Dai)6d ago

65SIG

80CONF

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

Real-world spatial intelligence requires reasoning over a continuous and evolving 3D world, yet existing VLMs and too...

Qwen

Models

Qwen

OpenAI

Read Brief Source

researchHugging Face (Joshua Ong Jun Leang)14d ago

65SIG

80CONF

Pythagoras-Prover: Advancing Efficient Formal Proving via Augmented Lean Formalisation

Modern Lean theorem provers achieve strong performance only with substantial training and inference compute, driven i...

DeepSeek

Research

DeepSeek

Read Brief Source

policyHugging Face (Sen Xu)9d ago

60SIG

100CONF

VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

This technical report introduces VibeThinker-3B, a compact dense model with 3B parameters developed to investigate ho...

DeepSeek

Policy

DeepSeek

Google DeepMind

Read Brief Source

safetyAxios (Jim VandeHei)6d ago

50.5SIG

56.5CONF

Jim VandeHei: Writing with AI

Few AI use cases elicit more outrage than writing: Using AI makes writing duller ... dumber ... robotic. It kills thi...

Safety

OpenAI

Anthropic

Read Brief Source

modelsHugging Face (Filip Sondej)9d ago

59SIG

80CONF

RepSelect: Robust LLM Unlearning via Representation Selectivity

Making large language models (LLMs) deeply forget specific knowledge and values without sacrificing general capabilit...

DeepSeek

Models

DeepSeek

Meta AI

Read Brief Source

modelsarXiv (Despina Christou)2d ago

70SIG

100CONF

Sub-Billion, Super-Frontier: Small Language Models Rival Zero-Shot Frontier LLMs on General and Literary Relation Extraction

Large language models (LLMs) achieve strong relation extraction (RE), but their computational demands and reliance on...

Qwen

Models

Qwen

OpenAI

Read Brief Source

modelsarXiv (Miso Choi)9d ago

70SIG

100CONF

The Truth Stays in the Family: Enhancing Contextual Grounding via Inherited Truthful Heads in Model Lineages

Recent advances in large language models (LLMs) have produced many specialized multimodal LLMs (MLLMs) that share com...

Mistral

Models

Mistral

Meta AI

Read Brief Source

modelsIEEE Spectrum AI (Dina Genkina)13d ago

58.5SIG

42.5CONF

Timing Trick Cuts Energy Used in LLM Training by Up to 14 Percent

OpenAI’s fourth large language model (LLM), GPT-4, took an estimated 50 gigawatt-hours to train, or the equivalent of...

OpenAI

Models

OpenAI

Nvidia

Read Brief Source

13 signals found for "Fine-tuning"

modelsarXiv (Zilin Xiao)12d ago

45SIG

90CONF

Learning to Reason by Analogy via Retrieval-Augmented Reinforcement Fine-Tuning

Retrieval-augmented generation (RAG) has become a standard mechanism for grounding language models in external knowle...

Qwen

Models

Qwen

Read Brief Source

modelsHugging Face (Sanket Badhe)22d ago

65SIG

80CONF

Prompt-Level Distillation: A Non-Parametric Alternative to Model Fine-Tuning for Efficient Reasoning

Advanced reasoning typically requires Chain-of-Thought prompting, which is accurate but incurs prohibitive latency an...

Mistral

Models

Mistral

Read Brief Source

modelsNVIDIA Developer Blog8d ago

65SIG

80CONF

Fine-Tuning Biological Foundation Models with LoRA Using NVIDIA BioNeMo Recipes

Foundation models are reshaping computational biology. Pretrained on massive corpora of protein or genomic sequences,...

Nvidia

Models

Nvidia

Read Brief Source

researcharXiv (Haotao Xie)13d ago

45SIG

90CONF

System Report for CCL25-Eval Task 5: New Dataset and LoRA-Fine-Tuned Qwen2.5

Recently, large language models (LLMs) have achieved promising progress in the fields of classical Chinese translatio...

Research

Read Brief Source

modelsGitHub (vachhanidhyey)10d ago

55SIG

33CONF

vachhanidhyey/llm_intercept released an update

🚀 Intercept and log LLM calls to build fine-tuning datasets for compact models, enabling cost-effective and efficien...

Models

LLM

Read Brief Source

modelsHugging Face (Yalun Dai)6d ago

65SIG

80CONF

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

Real-world spatial intelligence requires reasoning over a continuous and evolving 3D world, yet existing VLMs and too...

Qwen

Models

Qwen

OpenAI

Read Brief Source

researchHugging Face (Joshua Ong Jun Leang)14d ago

65SIG

80CONF

Pythagoras-Prover: Advancing Efficient Formal Proving via Augmented Lean Formalisation

Modern Lean theorem provers achieve strong performance only with substantial training and inference compute, driven i...

DeepSeek

Research

DeepSeek

Read Brief Source

policyHugging Face (Sen Xu)9d ago

60SIG

100CONF

VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

This technical report introduces VibeThinker-3B, a compact dense model with 3B parameters developed to investigate ho...

DeepSeek

Policy

DeepSeek

Google DeepMind

Read Brief Source

safetyAxios (Jim VandeHei)6d ago

50.5SIG

56.5CONF

Jim VandeHei: Writing with AI

Few AI use cases elicit more outrage than writing: Using AI makes writing duller ... dumber ... robotic. It kills thi...

Safety

OpenAI

Anthropic

Read Brief Source

modelsHugging Face (Filip Sondej)9d ago

59SIG

80CONF

RepSelect: Robust LLM Unlearning via Representation Selectivity

Making large language models (LLMs) deeply forget specific knowledge and values without sacrificing general capabilit...

DeepSeek

Models

DeepSeek

Meta AI

Read Brief Source

modelsarXiv (Despina Christou)2d ago

70SIG

100CONF

Sub-Billion, Super-Frontier: Small Language Models Rival Zero-Shot Frontier LLMs on General and Literary Relation Extraction

Large language models (LLMs) achieve strong relation extraction (RE), but their computational demands and reliance on...

Qwen

Models

Qwen

OpenAI

Read Brief Source

modelsarXiv (Miso Choi)9d ago

70SIG

100CONF

The Truth Stays in the Family: Enhancing Contextual Grounding via Inherited Truthful Heads in Model Lineages

Recent advances in large language models (LLMs) have produced many specialized multimodal LLMs (MLLMs) that share com...

Mistral

Models

Mistral

Meta AI

Read Brief Source

modelsIEEE Spectrum AI (Dina Genkina)13d ago

58.5SIG

42.5CONF

Timing Trick Cuts Energy Used in LLM Training by Up to 14 Percent

OpenAI’s fourth large language model (LLM), GPT-4, took an estimated 50 gigawatt-hours to train, or the equivalent of...

OpenAI

Models

OpenAI

Nvidia

Read Brief Source