alphaXiv (@askalphaxiv) / X

alphaXiv

2,258 posts

alphaXiv

@askalphaxiv

High fidelity research

Joined November 2023

Pinned
alphaXiv
@askalphaxiv
May 12
Reinforcing Recursive Language Models Can a 4B model learn to recursively call itself to answer hard long-context questions? We RL fine-tuned a small model to behave as a native RLM. On evidence selection across scientific papers, our 4B RLM matches Sonnet 4.6 in quality
68K
alphaXiv
@askalphaxiv
Apr 8, 2025
Introducing Deep Research for arXiv Ask questions like 'What are the latest breakthroughs in RL fine-tuning?' and get comprehensive lit reviews with trending papers automatically included Turn hours of literature searches into seconds with AI-powered research context ⚡
00:00
373K
alphaXiv
@askalphaxiv
Oct 15, 2025
Introducing NotebookLM for arXiv papers 🚀 Transform dense AI research into an engaging conversation With context across thousands of related papers, it captures motivations, draws connections to SOTA, and explains key insights like a professor who's read the entire field
00:00
216K
alphaXiv
@askalphaxiv
Oct 21, 2025
We used DeepSeek OCR to extract every dataset from tables/charts across 500k+ AI arXiv papers for $1000 🚀 See which benchmarks are trending and discover datasets you didn't know existed Doing the same task with Mistral OCR would've cost $7500 👀
00:00
285K
alphaXiv
@askalphaxiv
Jun 13, 2025
Claude is now being listed as an author on arXiv papers A response paper to Apple's "Illusion of Thinking" work just dropped with Claude Opus as first author, critiquing their experimental design and arguing the reasoning collapse was actually just token limit constraints.
367K
alphaXiv
@askalphaxiv
Jul 23, 2025
Google has shared the system prompt that got Gemini 2.5 Pro IMO 2025 Gold Medal 🏅 paper now #1 trending on alphaXiv 📈
Readers added context they thought people might want to knowReaders added context
This is not the prompt used by GDM. This was a prompted used by UCLA professors who used a different model, and published it here: arxiv.org/abs/2507.15855 There is no official GDM paper yet.
Context is written by people who use X, and appears when rated helpful by others. Find out more.
278K
alphaXiv
@askalphaxiv
Mar 12, 2025
We used Mistral OCR with Claude 3.7 to create blog-style overviews for arXiv papers Generate beautiful research blogs with figures, key insights, and clear explanations from the paper with just one click Understand papers in minutes - not hours
00:00
154K
alphaXiv
@askalphaxiv
Feb 15, 2025
1997: Deep Blue defeats Kasparov at chess 2016: AlphaGo masters the game of Go 2025: Stanford researchers crack Among Us Trending on alphaXiv 📈 Remarkable new work trains LLMs to master strategic social deduction through multi-agent RL, doubling win rates over standard RL.
00:00
210K
alphaXiv
@askalphaxiv
Jun 17, 2025
Introducing your arXiv Research Agent A personal research assistant with access to arXiv + bioRxiv + medRxiv + Semantic Scholar. Upload drafts, conduct literature reviews, get insights across millions of papers MCP support coming soon 🚀
00:00
128K
alphaXiv
@askalphaxiv
Jan 28, 2025
We used DeepSeek-V3 to classify every AI paper on arXiv by topic (agents, VLMs, etc) 🚀 Now you can instantly filter to see what's trending in each area 🚨
00:00
142K
alphaXiv
@askalphaxiv
Sep 15, 2025
First paper published by Meta Superintelligence Labs! In this paper, they make RAG faster by swapping most retrieved tokens for precomputed & reusable chunk embeddings, called REFRAG This method improves its speed by 30x and fitting 16x longer contexts without accuracy loss
178K
alphaXiv
@askalphaxiv
Feb 6, 2025
We used Gemini 2 Flash to build Cursor for arXiv papers Highlight any section of a paper to ask questions and “@” other papers to quickly add to context and compare results, benchmarks, etc.
00:00
145K
alphaXiv
@askalphaxiv
Oct 22, 2025
DeepSeek OCR processes PDFs at 1/10th the cost of traditional OCR tools We're now hosting it on alphaXiv's API! Extract figures, complex diagrams, and text from any PDF 🚀
00:00
81K
alphaXiv
@askalphaxiv
Jul 11, 2025
are we finally getting rid of tokenization? "Dynamic Chunking for End-to-End Hierarchical Sequence Modeling" with a hierarchical “H-Net” that learns content-aware byte-level boundaries end-to-end, they eliminated fixed tokenization and surpasses BPE-based Transformers
75K