DeepSeek - R1 알고리즘 관련 글

Hwiyong Jo 2025. 2. 8. 16:45

잘 정리된 블로그가 있어 기록상 남겨놓음

https://yoonschallenge.tistory.com/971

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning - 논문 리뷰

https://arxiv.org/abs/2501.12948 DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement LearningWe introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1. DeepSeek-R1-Zero, a model trained via large-scale rein

yoonschallenge.tistory.com

많이 쉬운 가장 기초중의 기초 내용

https://www.youtube.com/watch?v=KTonvXhsxpc