DeepSeek - Myaiblog

DeepSeek

GRPO (Group Relative Policy Optimization) Study Notes

GRPO (Group Relative Policy Optimization) Study Notes

We introduce Group Relative Policy Optimization (GRPO), a variant of Proximal Policy Optimization (PPO)

2025-03-04

DeepSeek #OpenSourceWeek - Five Consecutive Releases

DeepSeek #OpenSourceWeek - Five Consecutive Releases

We're a tiny team @deepseek_ai exploring AGI.

2025-02-28

DeepSeek-R1 - Andrej Karpathy in-depth explanation of LLM (Part 9)

DeepSeek-R1 - Andrej Karpathy in-depth explanation of LLM (Part 9)

DeepSeek R1

2025-02-24

How Different Large Models Like DeepSeek R1/ChatGPT o3/Grok3 Think

How Different Large Models Like DeepSeek R1/ChatGPT o3/Grok3 Think

LLM Think

2025-02-19

Andrej Karpathy in-depth explanation of large language model (LLM) technology (Part 1) - [Pretraining and Inference]

Andrej Karpathy in-depth explanation of large language model (LLM) technology (Part 1) - [Pretraining and Inference]

- introduction - pretraining data (internet) - tokenization - neural network I/O - neural network internals - inference

2025-02-11

DeepSeek Janus Series: Unified Multimodal Understanding and Generation Models

DeepSeek Janus Series: Unified Multimodal Understanding and Generation Models

Janus-Series: Unified Multimodal Understanding and Generation Models

2025-01-28

Comparison of the reasoning processes between ChatGPT o1 pro and DeepSeek R1

Comparison of the reasoning processes between ChatGPT o1 pro and DeepSeek R1

DeepSeek R1 Vs ChatGPT 01 (My Experience)

2025-01-27

DeepSeek R1: X.com User Reviews

DeepSeek R1: X.com User Reviews

Deepseek-r1 is open source and on par with o1 preview - @bindureddy

2025-01-26

Paper of DeepSeek-R1: Exploration and Breakthrough of the New Generation Inference Model

Paper of DeepSeek-R1: Exploration and Breakthrough of the New Generation Inference Model

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

2025-01-25