1 link tagged with all of: llm + reinforcement-learning + benchmarks + deepseek
Click any tag below to further narrow down your results
Links
This article discusses advancements in the Deepseek model, highlighting reduced attention complexity and innovations in reinforcement learning training. It also critiques the assumptions surrounding open-source large language models and questions the benchmarks used to evaluate their performance.