large language models

technology→ stable

LLMsLarge Vision-Language Modelslegal language models

A large language model (LLM) is a language model trained with self-supervised machine learning on a vast amount of text, designed for natural language processing tasks, especially language generation. The largest and most capable LLMs are generative pre-trained transformers (GPTs) that provide the c

131Total Mentions

+0.07Sentiment (Neutral)

+0.7%Velocity (7d)

First seen: Feb 16, 2026Last active: 3h agoWikipedia

Timeline

Research MilestoneMar 29, 2026
New mechanistic studies confirm LLMs exhibit sycophancy as core reasoning behavior, not a superficial bug
View source
Research MilestoneMar 23, 2026
Researchers proposed training framework for formal counterexample generation in Lean 4, addressing neglected skill in mathematical AI.
View source
method:
symbolic mutation strategy and multi-reward framework
Research MilestoneMar 18, 2026
Research reveals LLMs can 'self-purify' against poisoned data in RAG systems, identifying and down-ranking falsehoods
View source
Research MilestoneMar 10, 2026
Criticized for limitations in achieving human-level reasoning and autonomy
Research MilestoneMar 4, 2026
Neuro-symbolic system combining LLMs with constraint solvers improves performance by 25% on inductive definition proof tasks
View source
Research MilestoneFeb 23, 2026
Study reveals critical gaps in LLM responses to technology-facilitated abuse scenarios
View source
Research MilestoneFeb 18, 2026
Discovery of 'double-tap effect' where repeating prompts dramatically improves LLM accuracy from 21% to 97%.
View source
accuracy improvement:
21% to 97%

Relationships

Uses

→
Rohan Paul
person8 mentions70% conf.
→
LLMs
research topic7 mentions65% conf.
→
reinforcement learning
technology4 mentions50% conf.
→
Yann LeCun
person4 mentions50% conf.
→
mathematical proofs
research topic1 source70% conf.
←
arXiv
organization16 mentions70% conf.
←
AI Agents
technology8 mentions70% conf.
←
Retrieval-Augmented Generation
technology6 mentions60% conf.
←
Anthropic
company4 mentions50% conf.
←
VLM4Rec
technology2 sources100% conf.
←
Meta
company2 sources85% conf.
←
SemSIEdit
technology1 source30% conf.
←
RLHF
technology1 source90% conf.
←
GraSPer
technology1 source90% conf.
←
Tool-R0
technology1 source30% conf.
←
Direct Preference Optimization
technology1 source90% conf.
←
DMCD
technology1 source90% conf.
←
LMMRec
product1 source95% conf.
←
Refine-POI
product1 source95% conf.
←
autonomous driving systems
research topic1 source90% conf.
←
IKGR
product1 source95% conf.
←
ZorBA
technology1 source90% conf.
←
Evolving Demonstration Optimization
research topic1 source90% conf.
←
SymTorch
technology1 source80% conf.

Endorsed

←
Andrej Karpathy
person1 source80% conf.

Predictions

No predictions linked to this entity.

AI Discoveries

observationactive3d ago
Sentiment divergence: large language models vs Yann LeCun
large language models and Yann LeCun have a 'uses' relationship (4 evidence articles) but their recent sentiment has diverged significantly: large language models=0.06, Yann LeCun=0.60 (gap=0.54). Sentiment divergence between related entities often signals an emerging conflict, leadership change, or
70% confidence
observationactive4d ago
Graph bridge: large language models
large language models is a graph bridge — connects 57 entities across otherwise separate clusters (bridge_score=4.6). Changes to this entity would cascade widely.
80% confidence
discoveryactive4d ago
arXiv as Early Warning System for Competitive Shifts
High co-occurrence between arXiv and major AI companies (Anthropic 45, OpenAI 56) indicates these companies are racing to publish research that signals capability shifts before product launches, creating a 'research-to-product' pipeline visible 3-6 months in advance
78% confidence
discoveryactive6d ago
Anthropic's Research-to-Product Pipeline Acceleration
Anthropic is compressing the research-to-production cycle by directly integrating arXiv-level research into Claude Code, bypassing traditional academic-to-industry transfer delays
82% confidence
discoveryactiveMar 24, 2026
Claude Code as Research Infrastructure Trojan Horse
Claude Code's high mentions alongside arXiv and unconnectedness to research topics suggests it's becoming de facto research infrastructure, not just a coding tool. Researchers are using it to automate literature reviews, paper writing, and experimental code generation, creating a silent lock-in effe
85% confidence
observationactiveMar 22, 2026
[Compressed] Institutional knowledge: large language models
TRAJECTORY: Our understanding of large language models has evolved from viewing them as a singular frontier of capability to recognizing they are in a strategic transition to becoming a foundational component within more complex, multi-agent and collaborative systems, a shift marked by volatile sent
80% confidence
observationactiveMar 18, 2026
Graph bridge: large language models
large language models is a graph bridge — connects 51 entities across otherwise separate clusters (bridge_score=4.4). Changes to this entity would cascade widely.
80% confidence
observationactiveMar 11, 2026
Graph bridge: large language models
large language models is a graph bridge — connects 39 entities across otherwise separate clusters (bridge_score=4.7). Changes to this entity would cascade widely.
80% confidence
observationactiveMar 8, 2026
Lifecycle: large language models
large language models is in 'established' phase (11 mentions/3d, 47/14d, 65 total)
90% confidence
hypothesisactiveFeb 24, 2026
H: The push to capitalize on the double-tap effect will, within a quarter, trigger the first public con
The push to capitalize on the double-tap effect will, within a quarter, trigger the first public controversy over 'inference laundering'—where a company's benchmark results are achieved via undisclosed, costly multi-pass runs not available to standard API users.
70% confidence

Sentiment History

6-W086-W116-W14

Positive sentiment

Negative sentiment

Range: -1 to +1

Week	Avg Sentiment	Mentions
2026-W08	0.07	17
2026-W09	0.01	17
2026-W10	0.05	31
2026-W11	0.17	21
2026-W12	0.09	14
2026-W13	0.06	24
2026-W14	0.07	7

large language models

Timeline

Relationships

Uses

Endorsed

Recent Articles

New Research: Fine-Tuned LLMs Outperform GPT-5 for Probabilistic Supply Chain Forecasting

LLM Observability and XAI Emerge as Key GenAI Trust Layers

MemRerank: A Reinforcement Learning Framework for Distilling Purchase History into Personalized Product Reranking

GameMatch AI Proposes LLM-Powered Identity Layer for Semantic Search in Recommendations

Ollama Now Supports Apple MLX Backend for Local LLM Inference on macOS

Rethinking Recommendation Paradigms: From Pipelines to Agentic Recommender Systems

Apple Silicon Achieves Near-Lossless LLM Compression at 3.5 Bits-Per-Weight, Claims Independent Tester

Mechanistic Research Reveals Sycophancy as Core LLM Reasoning, Not a Superficial Bug

A Comparative Guide to LLM Customization Strategies: Prompt Engineering, RAG, and Fine-Tuning

Unitree Robotics Releases UnifoLM-WBT-Dataset: A Large-Scale, Real-World Robotics Dataset for Embodied AI

SELLER: A New Sequence-Aware LLM Framework for Explainable Recommendations

Perplexity CEO Aravind Srinivas Argues AI-Driven Layoffs Could Fuel Small Business Boom

Google DeepMind's 'Learning Through Conversation' Paper Shows LLMs Can Improve with Real-Time Feedback

KARMA: Alibaba's Framework for Bridging the Knowledge-Action Gap in LLM-Powered Personalized Search

LLMs Can Now De-Anonymize Users from Public Data Trails, Research Shows

Predictions

AI Discoveries

Sentiment divergence: large language models vs Yann LeCun

Graph bridge: large language models

arXiv as Early Warning System for Competitive Shifts

Anthropic's Research-to-Product Pipeline Acceleration

Claude Code as Research Infrastructure Trojan Horse

[Compressed] Institutional knowledge: large language models

Graph bridge: large language models

Graph bridge: large language models

Lifecycle: large language models

H: The push to capitalize on the double-tap effect will, within a quarter, trigger the first public con

Sentiment History