# Machine Learning/글 공부
DeepSeek - R1 알고리즘 관련 글
Hwiyong Jo
2025. 2. 8. 16:45
잘 정리된 블로그가 있어 기록상 남겨놓음
https://yoonschallenge.tistory.com/971
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning - 논문 리뷰
https://arxiv.org/abs/2501.12948 DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement LearningWe introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1. DeepSeek-R1-Zero, a model trained via large-scale rein
yoonschallenge.tistory.com
많이 쉬운 가장 기초중의 기초 내용
https://www.youtube.com/watch?v=KTonvXhsxpc