DeepSeek ÓÖÔÚ¼ÙÆÚµ½À´µÄʱ¼äÓÐÐÂÐж¯ÁË¡£ ¸Õ¸Õ£¬DeepSeek ÔÚºÁÎÞÔ¤¸æµÄÇéÐÎÏ£¬Í»È»ÔÚ Hugging Face ƽ̨ÉÏ¿ªÔ´ÁË×îÐÂÊýѧ¶¨Àí֤ʵרÓÃÄ£×Ó DeepSeek-Prover-V2-671B¡£ £¨ÈªÔ´£ºHugging Face£© Õâ¸öÐÂÄ£×Ó²¢·ÇͨÓõÄ̸Ìì»úеÈË£¬¶øÊÇרעÓÚÊýѧ¶¨ÀíµÄÐÎʽ»¯Ö¤ÊµÕâÒ»¸ß¶ÈרҵµÄÁìÓò¡£ÕâÀàÄ£×ÓµÄÄ¿µÄÊÇʹÓÃÏñ Lean 4 ÕâÑùµÄ֤ʵÖúÊÖÈí¼þ£¬À´Ã÷È·ºÍÌìÉúÑÏ¿áµÄÊýѧ֤ʵ°ì·¨¡£¼òÆÓÀ´Ëµ£¬ËüÃÇÊÇ×ÊÖúÅÌËã»úÑéÖ¤Êýѧ¶¨Àí׼ȷÐ﵀ AI ¹¤¾ß£¬ÐèÒª¾ß±¸ºÜÇ¿µÄÂß¼ÍÆÀíÄÜÁ¦¡£ÆäÖ÷ÒªÓ¦Óó¡¾°°üÀ¨£º×Ô¶¯¶¨Àí֤ʵ£¨´Ó¸ßÖе½´óѧˮƽµÄÊýѧÎÊÌ⣩¡¢·¢Ã÷֤ʵÖеĹýʧ²¢ÌṩÐÞ¸´½¨Ò顢ͨ¹ýÌìÉú Lean 4 ´úÂëÏ¢ÕùÊÍ×ÊÖú½Ìѧ£¬ÒÔ¼°ÐÖúÊýѧ¼Ò̽Ë÷ж¨ÀíµÈ¡£ ÏÖʵÉÏ£¬DeepSeek ´Ëǰ¾ÍÒѾÃÐû²¼¹ýͬÀàÄ£×Ó£¬2024 Äê 8 ÔÂʱ£¬ËûÃÇÔøÐû²¼ÁËDeepSeek-Prover-V1.5£¬Ò»¸öԼĪ 7B ²ÎÊýµÄÄ£×Ó¡£Æ¾Ö¤ DeepSeek ÆäʱÐû²¼µÄÐÅÏ¢£¬V1.5 ÔÚÁ¬ÏµÇ¿»¯Ñ§Ï°ºÍÃÉÌØ¿¨ÂåÊ÷ËÑË÷µÈÊÖÒÕºó£¬ÔÚһЩ±ê×¼µÄÊýѧ֤ʵ²âÊÔ£¨ÈçminiF2F ºÍ ProofNet£©ÖÐÈ¡µÃÁ˲»´íµÄЧ¹û£¬Äܹ»´¦Öóͷ£´Ó¸ßÖе½´óѧ±¾¿Æ²¿·ÖˮƽµÄÊýѧÎÊÌâ¡£ ͼØProver-V1.5 µÄ»ù×¼²âÊÔ£¨ÈªÔ´£ºDeepSeek£© Õâ´ÎÐû²¼µÄ DeepSeek-Prover-V2-671B£¬ÔÚÄ£×Ó¹æÄ£ÉÏÓÐÁËÖØ´óµÄ±¼ÌÚ£¬²ÎÊýÄ¿µÖ´ïÁË 671B £¬±È V1.5 ´óÁ˽ü°Ù±¶£¬±ÈÆäËûͬÀà²úÆ·Èç Llemma-7B/34B¡¢InternLM2-StepProver µÈÒ²Òª´óµÃ¶à¡£ ƾ֤Æä¹ûÕæµÄÉèÖÃÎļþ£¬ÎÒÃÇ¿ÉÒÔÏàʶµ½¸ü¶à¹ØÓÚÄ£×ӽṹµÄÐÅÏ¢¡£¸ÃÄ£×Ó½¨ÉèÔÚ DeepSeek-V3 ¼Ü¹¹Ö®ÉÏ£¬Òò´ËÐí¶àÉèÖÃÓëͨÓÃµÄ DeepSeek-V3 Ä£×ÓÏàËÆ¡£Ëü½ÓÄÉÁË»ìÏýר¼Ò£¨MoE£¬Mixture-of-Experts£©µÄÉè¼Æ£¬ÏêϸÀ´Ëµ£¬Ã¿²ã°üÀ¨ 256 ¸ö·ÓÉר¼Ò£¨routed experts£©ºÍ1¸ö¹²Ïíר¼Ò£¨shared expert£©£¬Ã¿¸öר¼ÒµÄÖÐÐIJã´óС£¨moe_intermediate_size£©Îª 2048£¬ÔÚ´¦Öóͷ£Ã¿¸öÊäÈë·ûºÅ£¨token£©Ê±»á¼¤»îÆäÖÐµÄ 8 ¸öר¼Ò¡£±ðµÄ£¬¸ÃÄ£×ÓÖ§³ÖµÄ×î´óÉÏÏÂÎij¤¶ÈµÖ´ïÁË 163,840 ¸ö token¡£ ͼØÉèÖÃÎļþ£¨ÈªÔ´£ºHugging Face£© ²»¹ý£¬×èÖ¹·¢¸åʱ£¬DeepSeek ¹Ù·½ÉÐδÐû²¼¸ü¶à¹ØÓÚ¸ÃÄ£×ÓµÄÊÖÒÕϸ½ÚºÍÐÔÄÜÊý¾Ý¡£¹ØÓÚ DeepSeek-Prover-V2-671B µÄѵÁ·ÒªÁ졢ʹÓÃÁËÄÄÐ©ÌØ¶¨ÓÚÊýѧ֤ʵµÄÊý¾Ý£¬ÒÔ¼°ËüÔÚ»ù×¼²âÊÔÉϵÄÏÖʵÌåÏÖÔõÑùµÈÒªº¦ÐÅÏ¢£¬ÏÖÔÚÈÔÒ»ÎÞËùÖª¡£ ¹ØÓÚÕâ¸öÐÂÄ£×ÓµÄÄÚ²¿½á¹¹ºÍÏêϸÄÜÁ¦£¬ÉÐÓдý¹Ù·½Ìṩ¸ü¶àÐÅÏ¢¡£Ë¼Á¿µ½²ÎÊýÄ¿µÄÖØ´óÌáÉý£¬ÎÒÃÇ¿ÉÒÔÆÚ´ý Prover-V2 ÄÜÔÚ¸÷ÏîÊýѧ֤ʵ»ù×¼ÉÏÈ¡µÃ¸üºÃµÄЧ¹û¡£ ²Î¿¼×ÊÁÏ£º 1.https://huggingface.co/deepseek-ai/DeepSeek-Prover-V2-671B/tree/main 2.https://arxiv.org/abs/2408.08152 ÅŰ棺ÁõÑÅÀ¤