Newsletter
Get notified when new AI tools are added
Join the community.
Chinese ChatGPT competitor for chat, coding, and analysis
DeepSeek V3 is a large language model developed by the Chinese company DeepSeek. It’s used for text generation and analysis, translation, and writing code, with an interface similar to ChatGPT or Claude.
DeepSeek reports a 671B-parameter model trained on 14.8T tokens, trained for about two months on an Nvidia H800 cluster, with an estimated cost of about $5.5M. On Codeforces, DeepSeek V3 scored higher than Llama 3.1 and GPT-4o.
Independent testing highlights that results vary by evaluation method. For example, Alessandro Quadron reported different scores for OpenAI o1 depending on the framework, and found DeepSeek V3 roughly comparable to o1 in programming tasks, while Anthropic Sonnet 3.5 scored higher in that setup.
0 comments
No comments yet
Start the discussion and your comment will appear here right away.