【少儿不宜】没人聊聊星际之门和deepseek?
登录 | 论坛导航 -> 华新鲜事 -> 心情闲聊 | 本帖共有 89 楼,当前显示第 44 楼 : 从楼主开始阅读 : 本帖树形列表 : 返回上一页
作者:萧武达 (等级:5 - 略有小成,发帖:2012) 发表:2025-02-01 19:34:16  44楼 
你不看美国各大厂公告?。。。。
meta
"Managers and engineers from Meta’s generative AI group and infrastructure team have started four war rooms to learn how DeepSeek works. Two of the mobilized
groups are trying to understand how High-Flyer lowered the cost of training and running DeepSeek. Meta wants to apply those techniques, a number of which a
technical paper from High-Flyer outlined, to Llama, one of the employees said. ...

A third Meta research group is trying to figure out what data High-Flyer might have used to train its models, according to one of the employees with direct
knowledge.

The fourth war room is considering new techniques for restructuring Meta’s models based on attributes of the DeepSeek models, they said. Meta is considering
launching a version of Llama that, like DeepSeek, would include numerous AI models, each trained to handle different tasks. That way, when a customer asks Llama
to handle a certain task, only some parts of the model would need to work on it. That could make the overall model faster and require less computing power to
operate."
[本文发送自华新手机Wap版]
欢迎来到华新中文网,踊跃发帖是支持我们的最好方法!原文 / 传统版 / WAP版只看此人从这里展开收起列表

本帖共有 89 楼,当前显示第 44 楼,本文还有 N-1 层楼,要不你试试看:点击此处阅读更多 >>



请登录后回复:帐号   密码