Chinese’s DeepSeek-Coder-V2 – Breaking the Barrier of Closed-Source Models in open source Code Intelligence

June 18, 2024

pIXELsHAM.com

An open-source Mixture-of-Experts (MoE) code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks. Specifically, DeepSeek-Coder-V2 is further pre-trained from an intermediate checkpoint of DeepSeek-V2 with additional 6 trillion tokens. Through this continued pre-training, DeepSeek-Coder-V2 substantially enhances the coding and mathematical reasoning capabilities of DeepSeek-V2, while maintaining comparable performance in general language tasks. Compared to DeepSeek-Coder-33B, DeepSeek-Coder-V2 demonstrates significant advancements in various aspects of code-related tasks, as well as reasoning and general capabilities. Additionally, DeepSeek-Coder-V2 expands its support for programming languages from 86 to 338, while extending the context length from 16K to 128K.

https://github.com/deepseek-ai/DeepSeek-Coder-V2

https://venturebeat.com/ai/chinas-deepseek-coder-becomes-first-open-source-coding-model-to-beat-gpt-4-turbo/

COLLECTIONS

| Featured AI
| Design And Composition
| Explore posts

POPULAR SEARCHES

FEATURED POSTS

Social Links

DISCLAIMER – Links and images on this website may be protected by the respective owners’ copyright. All data submitted by users through this site shall be treated as freely available to share.

Chinese’s DeepSeek-Coder-V2 – Breaking the Barrier of Closed-Source Models in open source Code Intelligence

Glossary of Lighting Terms – cheat sheet

The CG Career YouTube channel is live!

Kling 1.6 and competitors – advanced tests and comparisons

Eddie Yoon – There’s a big misconception about AI creative

Want to build a start up company that lasts? Think three-layer cake

Game Development tips

Types of Film Lights and their efficiency – CRI, Color Temperature and Luminous Efficacy

Tencent Hunyuan3D 2.1 goes Open Source and adds MV (Multi-view) and MV Mini