刚刚腾讯混元又开源了4款适合设备端的小模型,从0.5B到7B不等,单卡即可部署!

具备智能体能力,可以执行规划、工具调用以及复杂决策任务

原生支持256K长上下文 可以选择快慢思考模式 支持 SGLang、vLLM 和 TensorRT-LLM等主流框架 可用于手机、平板、智能家居、智能汽车等

GitHub: Hunyuan-0.5B:https://github.com/Tencent-Hunyuan/Hunyuan-0.5B Hunyuan-1.8B:https://github.com/Tencent-Hunyuan/Hunyuan-1.8B Hunyuan-4B:https://github.com/Tencent-Hunyuan/Hunyuan-4B Hunyuan-7B:https://github.com/Tencent-Hunyuan/Hunyuan-7B

HuggingFace: Hunyuan-0.5B:https://huggingface.co/tencent/Hunyuan-0.5B-Instruct Hunyuan-1.8B:https://huggingface.co/tencent/Hunyuan-1.8B-Instruct Hunyuan-4B:https://huggingface.co/tencent/Hunyuan-4B-Instruct Hunyuan-7B:https://huggingface.co/tencent/Hunyuan-7B-Instruct

image.png

image.png