Grok Papers

32 papers

Expertise level:

• Simplified overview for general audiences

★ Hall of Fame

DeepSeek-V3 Technical Report

DeepSeek-AI·Dec 2024·420 citationsReasoning & Alignment

The Efficiency Manifesto. Introduced Multi-Head Latent Attention (MLA) and DeepSeekMoE, proving GPT-4 class models can be trained for $5.5M.

Beginner PPTX Expert PPTX Source PDF arXiv

★ Hall of Fame

DeepSeek-V3 Technical Report

DeepSeek-AI·Dec 2024·420 citations

The Efficiency Manifesto. Introduced Multi-Head Latent Attention (MLA) and DeepSeekMoE, proving GPT-4 class models can be trained for $5.5M.

Beginner PPTX Expert PPTX Source

★ Hall of Fame

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Gu, A., Dao, T.·Dec 2023·1.5k citationsSystems & Scaling

The Transformer Challenger. Proposed a modern State Space Model (SSM) architecture that offers linear scaling, influencing new "hybrid" architectures.

Beginner PPTX Expert PPTX Source PDF arXiv

★ Hall of Fame

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Gu, A., Dao, T.·Dec 2023·1.5k citations

The Transformer Challenger. Proposed a modern State Space Model (SSM) architecture that offers linear scaling, influencing new "hybrid" architectures.

Beginner PPTX Expert PPTX Source

★ Hall of Fame

Direct Preference Optimization (DPO)

Rafailov, R., Sharma, A., Mitchell, E. et al.·May 2023·2.8k citationsReasoning & Alignment

Killed PPO. Simplified alignment by mathematically showing you can optimize for human preferences directly without training a separate Reward Model.

Beginner PPTX Expert PPTX Source PDF arXiv

★ Hall of Fame

Direct Preference Optimization (DPO)

Rafailov, R., Sharma, A., Mitchell, E. et al.·May 2023·2.8k citations

Killed PPO. Simplified alignment by mathematically showing you can optimize for human preferences directly without training a separate Reward Model.

Beginner PPTX Expert PPTX Source

★ Hall of Fame

QLoRA: Efficient Finetuning of Quantized LLMs

Dettmers, T., Pagnoni, A., Holtzman, A. et al.·May 2023·2.2k citationsSystems & Scaling

The Democratizer. Combined 4-bit quantization with LoRA, allowing anyone to finetune a 65B parameter model on a single consumer GPU.

Beginner PPTX Expert PPTX Source PDF arXiv

★ Hall of Fame

QLoRA: Efficient Finetuning of Quantized LLMs

Dettmers, T., Pagnoni, A., Holtzman, A. et al.·May 2023·2.2k citations

The Democratizer. Combined 4-bit quantization with LoRA, allowing anyone to finetune a 65B parameter model on a single consumer GPU.

Beginner PPTX Expert PPTX Source

★ Hall of Fame

Voyager: An Open-Ended Embodied Agent with Large Language Models

Wang, G., Xie, Y., Jiang, Y. et al.·May 2023·950 citationsSystems & Scaling

The Agent Blueprint. One of the first papers to successfully use an LLM to write code, execute it in Minecraft, fail, and self-correct via a feedback loop.

Beginner PPTX Expert PPTX Source PDF arXiv

★ Hall of Fame

Voyager: An Open-Ended Embodied Agent with Large Language Models

Wang, G., Xie, Y., Jiang, Y. et al.·May 2023·950 citations

The Agent Blueprint. One of the first papers to successfully use an LLM to write code, execute it in Minecraft, fail, and self-correct via a feedback loop.

Beginner PPTX Expert PPTX Source

★ Hall of Fame

Segment Anything (SAM)

Kirillov, A., Mintun, E., Ravi, N. et al.·Apr 2023·4.2k citationsComputer Vision

Meta's foundation model for image segmentation that generalizes to zero-shot objects.

Beginner PPTX Expert PPTX Source PDF arXiv

★ Hall of Fame

Segment Anything (SAM)

Kirillov, A., Mintun, E., Ravi, N. et al.·Apr 2023·4.2k citations

Meta's foundation model for image segmentation that generalizes to zero-shot objects.

Beginner PPTX Expert PPTX Source

★ Hall of Fame

LLaMA: Open and Efficient Foundation Language Models

Touvron, H., Lavril, T., Izacard, G. et al.·Feb 2023·8.5k citationsLLMs & Transformers

Meta's release that kickstarted the open-source LLM race by proving smaller, better-trained models can rival giants.

Beginner PPTX Expert PPTX Source PDF arXiv

★ Hall of Fame

LLaMA: Open and Efficient Foundation Language Models

Touvron, H., Lavril, T., Izacard, G. et al.·Feb 2023·8.5k citations

Meta's release that kickstarted the open-source LLM race by proving smaller, better-trained models can rival giants.

Beginner PPTX Expert PPTX Source

★ Hall of Fame

Adding Conditional Control to Text-to-Image Diffusion Models (ControlNet)

Zhang, L., Rao, A., Agrawala, M.·Feb 2023·3.8k citationsGenerative AI

Allowed precise structural control (edges, pose, depth) over diffusion generation.

Beginner PPTX Expert PPTX Source PDF arXiv

★ Hall of Fame

Adding Conditional Control to Text-to-Image Diffusion Models (ControlNet)

Zhang, L., Rao, A., Agrawala, M.·Feb 2023·3.8k citations

Allowed precise structural control (edges, pose, depth) over diffusion generation.

Beginner PPTX Expert PPTX Source