zara
02-19
DS 刚出来的时候也做错这道题
9.11比9.9大?马斯克声称“天下最聪明”的Grok3“翻车了”
免责声明:上述内容仅代表发帖人个人观点,不构成本平台的任何投资建议。
分享至
微信
复制链接
精彩评论
我们需要你的真知灼见来填补这片空白
打开APP,发表看法
APP内打开
发表看法
{"i18n":{"language":"zh_CN"},"detailType":1,"isChannel":false,"data":{"magic":2,"id":405307373937040,"tweetId":"405307373937040","gmtCreate":1739976083387,"gmtModify":1739976085262,"author":{"id":3529649417410355,"idStr":"3529649417410355","authorId":3529649417410355,"authorIdStr":"3529649417410355","name":"zara","avatar":"https://static.tigerbbs.com/f5b3f32eca253c6f811b95740d6e9434","vip":1,"userType":1,"introduction":"","boolIsFan":false,"boolIsHead":false,"crmLevel":5,"crmLevelSwitch":1,"currentWearingBadge":{"badgeId":"228c86a078844d74991fff2b7ab2428d-3","templateUuid":"228c86a078844d74991fff2b7ab2428d","name":"投资合伙人虎","description":"证券账户累计交易金额达到100万美元","bigImgUrl":"https://static.tigerbbs.com/fbeac6bb240db7da8b972e5183d050ba","smallImgUrl":"https://static.tigerbbs.com/436cdf80292b99f0a992e78750ac4e3a","grayImgUrl":"https://static.tigerbbs.com/506a259a7b456f037592c3b23c779599","redirectLinkEnabled":0,"hasAllocated":1,"isWearing":1,"stampPosition":0,"hasStamp":0,"allocationCount":1,"allocatedDate":"2024.04.06","exceedPercentage":"93.57%","individualDisplayEnabled":0},"individualDisplayBadges":[],"fanSize":4,"starInvestorFlag":false},"themes":[],"images":[],"coverImages":[],"html":"<html><head></head><body><p>DS 刚出来的时候也做错这道题</p></body></html>","htmlText":"<html><head></head><body><p>DS 刚出来的时候也做错这道题</p></body></html>","text":"DS 刚出来的时候也做错这道题","highlighted":1,"essential":1,"paper":1,"likeSize":0,"commentSize":0,"repostSize":0,"favoriteSize":0,"link":"https://laohu8.com/post/405307373937040","repostId":2512812744,"repostType":2,"repost":{"id":"2512812744","kind":"news","pubTimestamp":1739949000,"share":"https://www.laohu8.com/m/news/2512812744?lang=&edition=full","pubTime":"2025-02-19 15:10","market":"hk","language":"zh","title":"9.11比9.9大?马斯克声称“天下最聪明”的Grok3“翻车了”","url":"https://stock-news.laohu8.com/highlight/detail?id=2512812744","media":"上观新闻","summary":"近日,马斯克与xAI团队,在直播中正式发布了最新版本Grok3。此前,马斯克将Grok-3描述为“地球上最聪明的AI”。遗憾的是,在不加任何定语以及标注的情况下,号称目前最聪明的Grok3,仍然无法正确回答这个问题。“9.11和9.9哪个大”是AI领域的一个经典问题。马斯克曾表示,xAI公司的目标就是“了解宇宙”。此外,xAI宣布推出名为Deepsearch的Grok-3智能搜索引擎,名字和Deepseek颇有几分相似。","content":"<html><body><div>\n<p cms-style=\"font-L\">近日,马斯克与xAI团队,在直播中正式发布了最新版本Grok3。</p><p cms-style=\"font-L\">此前,马斯克将Grok-3描述为“地球上最聪明的AI”。他在X平台上表示:“自己整个周末都在和团队打磨产品。”</p><p cms-style=\"font-L\">然而据媒体报道,有人测试了最新的Beta版Grok3,并提出了那个经典的用来刁难大模型的问题:“9.11与9.9哪个大?”遗憾的是,在不加任何定语以及标注的情况下,号称目前最聪明的Grok3,仍然无法正确回答这个问题。</p><div><img src=\"http://n.sinaimg.cn/spider20250219/114/w592h322/20250219/5cb5-863b4c07795f16c44ec5f217e8b4519f.jpg\"/><span></span></div><p cms-style=\"font-L\">值得一提的是,用同样的问题询问DeepSeek时,无论是否开启深度思考(R1)模式,对方都给出了正确的答案:9.9大于9.11。</p><div><img src=\"http://n.sinaimg.cn/spider20250219/336/w1080h856/20250219/1d22-a1a51111e521eb87ae0e2c15edab75db.jpg\"/><span></span></div><div><img src=\"http://n.sinaimg.cn/spider20250219/319/w1080h839/20250219/3d38-373a855073802b3d19e93845719507b8.jpg\"/><span></span></div><p cms-style=\"font-L\">“9.11和9.9哪个大”是AI领域的一个经典问题。</p><p cms-style=\"font-L\">艾伦研究机构(Allen Institute)成员林禹臣曾在社交媒体平台上发布的截图显示,ChatGPT-4o在回答中认为13.11比13.8更大。“一方面AI越来越擅长做数学奥赛题,但另一方面常识依旧很难。”他表示。</p><p cms-style=\"font-L\">随后Scale AI的提示工程师莱利·古德赛德(Riley Goodside)基于此灵感变换了问法,拷问了可能是当时最强的大模型ChatGPT-4o、<a href=\"https://laohu8.com/S/GOOG\">谷歌</a>Gemini Advanced以及Claude 3.5 Sonnet——9.11和9.9哪个更大?这几家主流大模型通通答错,他也成功将此话题传播开来。</p><div><img src=\"http://n.sinaimg.cn/spider20250219/682/w1080h402/20250219/bd5b-de9dd2f0c3983b6c7dfe4c1ae55be3b3.jpg\"/><span>海外主流大模型答题。图源:第一财经</span></div><p cms-style=\"font-L\">Grok-3发布会的背景板上,写着\"our mission is to understand universe(我们的使命是了解宇宙)\"。马斯克曾表示,xAI公司的目标就是“了解宇宙”。</p><p cms-style=\"font-L\">在一周前,马斯克在直播中评论DeepSeek R1时,曾信心满满地表示“xAI即将推出更优秀的AI模型”。从现场展示的数据来看,Grok3在数学、科学与编程的基准测试上已经超越了目前所有的主流模型,马斯克甚至宣称Grok 3未来将用于SpaceX火星任务计算,并预测“三年内将实现诺贝尔奖级别突破”。</p><div><img src=\"http://n.sinaimg.cn/spider20250219/164/w956h808/20250219/2fe2-2717ee9fb3476c84e84f242e32ede5b4.jpg\"/><span>xAI在X平台直播,马斯克到场。图源:中国新闻周刊</span></div><p cms-style=\"font-L\">马斯克强调,Grok-3可以减少AI幻觉,方法是通过来回检查数据并尝试实现逻辑一致性。他还透露,训练Grok-3所使用的算力远多于此前的版本,并使用了大量合成数据。</p><p cms-style=\"font-L\">不同于DeepSeek的算法优化路径(DeepSeek-V3用2048个H800 GPU,训练2788千小时),xAI透露,Grok-3的开发的得益于用8个月时间建成的Colossus超级计算机,它由10万个<a href=\"https://laohu8.com/S/NVDA\">英伟达</a>的H100 GPU驱动,为训练提供了2亿个GPU小时,比Grok-2多十多倍。</p><p cms-style=\"font-L\">此外,xAI宣布推出名为Deepsearch的Grok-3智能搜索引擎,名字和Deepseek颇有几分相似。</p><p cms-style=\"font-L\">来源:九派新闻综合东方财经、第一财经、中国新闻周刊等</p>\n<div>\n<div><img src=\"\"/></div>\n<div>海量资讯、精准解读,尽在新浪财经APP</div>\n</div>\n</div></body></html>","source":"sina","collect":0,"html":"<!DOCTYPE html>\n<html>\n<head>\n<meta http-equiv=\"Content-Type\" content=\"text/html; charset=utf-8\" />\n<meta name=\"viewport\" content=\"width=device-width,initial-scale=1.0,minimum-scale=1.0,maximum-scale=1.0,user-scalable=no\"/>\n<meta name=\"format-detection\" content=\"telephone=no,email=no,address=no\" />\n<title>9.11比9.9大?马斯克声称“天下最聪明”的Grok3“翻车了”</title>\n<style type=\"text/css\">\na,abbr,acronym,address,applet,article,aside,audio,b,big,blockquote,body,canvas,caption,center,cite,code,dd,del,details,dfn,div,dl,dt,\nem,embed,fieldset,figcaption,figure,footer,form,h1,h2,h3,h4,h5,h6,header,hgroup,html,i,iframe,img,ins,kbd,label,legend,li,mark,menu,nav,\nobject,ol,output,p,pre,q,ruby,s,samp,section,small,span,strike,strong,sub,summary,sup,table,tbody,td,tfoot,th,thead,time,tr,tt,u,ul,var,video{ font:inherit;margin:0;padding:0;vertical-align:baseline;border:0 }\nbody{ font-size:16px; line-height:1.5; color:#999; background:transparent; }\n.wrapper{ overflow:hidden;word-break:break-all;padding:10px; }\nh1,h2{ font-weight:normal; line-height:1.35; margin-bottom:.6em; }\nh3,h4,h5,h6{ line-height:1.35; margin-bottom:1em; }\nh1{ font-size:24px; }\nh2{ font-size:20px; }\nh3{ font-size:18px; }\nh4{ font-size:16px; }\nh5{ font-size:14px; }\nh6{ font-size:12px; }\np,ul,ol,blockquote,dl,table{ margin:1.2em 0; }\nul,ol{ margin-left:2em; }\nul{ list-style:disc; }\nol{ list-style:decimal; }\nli,li p{ margin:10px 0;}\nimg{ max-width:100%;display:block;margin:0 auto 1em; }\nblockquote{ color:#B5B2B1; border-left:3px solid #aaa; padding:1em; }\nstrong,b{font-weight:bold;}\nem,i{font-style:italic;}\ntable{ width:100%;border-collapse:collapse;border-spacing:1px;margin:1em 0;font-size:.9em; }\nth,td{ padding:5px;text-align:left;border:1px solid #aaa; }\nth{ font-weight:bold;background:#5d5d5d; }\n.symbol-link{font-weight:bold;}\n/* header{ border-bottom:1px solid #494756; } */\n.title{ margin:0 0 8px;line-height:1.3;color:#ddd; }\n.meta {color:#5e5c6d;font-size:13px;margin:0 0 .5em; }\na{text-decoration:none; color:#2a4b87;}\n.meta .head { display: inline-block; overflow: hidden}\n.head .h-thumb { width: 30px; height: 30px; margin: 0; padding: 0; border-radius: 50%; float: left;}\n.head .h-content { margin: 0; padding: 0 0 0 9px; float: left;}\n.head .h-name {font-size: 13px; color: #eee; margin: 0;}\n.head .h-time {font-size: 11px; color: #7E829C; margin: 0;line-height: 11px;}\n.small {font-size: 12.5px; display: inline-block; transform: scale(0.9); -webkit-transform: scale(0.9); transform-origin: left; -webkit-transform-origin: left;}\n.smaller {font-size: 12.5px; display: inline-block; transform: scale(0.8); -webkit-transform: scale(0.8); transform-origin: left; -webkit-transform-origin: left;}\n.bt-text {font-size: 12px;margin: 1.5em 0 0 0}\n.bt-text p {margin: 0}\n</style>\n</head>\n<body>\n<div class=\"wrapper\">\n<header>\n<h2 class=\"title\">\n9.11比9.9大?马斯克声称“天下最聪明”的Grok3“翻车了”\n</h2>\n\n<h4 class=\"meta\">\n\n\n2025-02-19 15:10 北京时间 <a href=https://finance.sina.com.cn/jjxw/2025-02-19/doc-inekzcyh5678030.shtml><strong>上观新闻</strong></a>\n\n\n</h4>\n\n</header>\n<article>\n<div>\n<p>近日,马斯克与xAI团队,在直播中正式发布了最新版本Grok3。此前,马斯克将Grok-3描述为“地球上最聪明的AI”。他在X平台上表示:“自己整个周末都在和团队打磨产品。”然而据媒体报道,有人测试了最新的Beta版Grok3,并提出了那个经典的用来刁难大模型的问题:“9.11与9.9哪个大?”遗憾的是,在不加任何定语以及标注的情况下,号称目前最聪明的Grok3,仍然无法正确回答这个问题。...</p>\n\n<a href=\"https://finance.sina.com.cn/jjxw/2025-02-19/doc-inekzcyh5678030.shtml\">Web Link</a>\n\n</div>\n\n\n</article>\n</div>\n</body>\n</html>\n","type":0,"thumbnail":"","relate_stocks":{"TSLA":"特斯拉","IE00BJLML261.HKD":"HSBC GLOBAL EQUITY INDEX \"HCH\" (HKD) ACC","LU2249611893.SGD":"BNP PARIBAS ENERGY TRANSITION \"CRH\" (SGD) ACC","LU0820561909.HKD":"ALLIANZ INCOME AND GROWTH \"AM\" (HKD) INC","LU0198837287.USD":"UBS (LUX) EQUITY SICAV - USA GROWTH \"P\" (USD) ACC","LU0323591593.USD":"SCHRODER ISF QEP GLOBAL QUALITY \"A\" (USD) ACC","LU0316494557.USD":"FRANKLIN GLOBAL FUNDAMENTAL STRATEGIES \"A\" ACC","LU2357305700.SGD":"Allianz Global Artificial Intelligence ET H2-SGD","SG9999015952.SGD":"LIONGLOBAL DISRUPTIVE INNOVATION \"I\" (SGD) ACC","LU1861559042.SGD":"日兴方舟颠覆性创新基金B SGD","LU1548497426.USD":"安联环球人工智能AT Acc","LU2471134952.CNY":"INVESCO GLOBAL EQUITY INCOME ADVANTAGE \"A\" (CNYHDG) INC","LU1435385759.SGD":"Natixis Loomis Sayles US Growth Equity RA SGD-H","IE0034235303.USD":"PINEBRIDGE US RESEARCH ENHANCED CORE EQUITY \"A\" (USD) ACC","IE00BK4W5L77.USD":"HSBC GLOBAL FUNDS ICAV US EQUITY INDEX \"HC\" (USD) ACC","LU1066051811.HKD":"HSBC GIF GLOBAL EQUITY VOLATILITY FOCUSED \"AM2\" (HKD) INC","BK4548":"巴美列捷福持仓","BK4516":"特朗普概念","BK4574":"无人驾驶","LU0823414478.USD":"法巴经典能源转换基金","BK4551":"寇图资本持仓","LU0097036916.USD":"贝莱德美国增长A2 USD","LU0689472784.USD":"安联收益及增长基金Cl AM AT Acc","TSYW.SI":"TESLA 3xLongSG261006","LU2471134796.USD":"INVESCO GLOBAL EQUITY INCOME ADVANTAGE \"A\" (USD) INC","LU0943347566.SGD":"安联收益及增长平衡基金AM H2-SGD","LU0234572021.USD":"高盛美国核心股票组合Acc","BK4527":"明星科技股","BK4543":"AI","LU1778281490.HKD":"HSBC GIF GLOBAL LOWER CARBON EQUITY \"AD\" (HKD) INC","LU2250418816.HKD":"BGF WORLD TECHNOLOGY \"A\" (HKD) ACC","LU0345769631.USD":"NINETY ONE GSF GLOBAL EQUITY \"A\" (USD) INC","LU2108987350.USD":"UBS (LUX) EQUITY SICAV GLOBAL OPPORTUNITY SUSTAINABLE (USD) \"P\" (USD) ACC","BK4099":"汽车制造商","LU1861558580.USD":"日兴方舟颠覆性创新基金B","BK4511":"特斯拉概念","BK4585":"ETF&股票定投概念","LU2213496289.HKD":"ALLIANZ INCOME AND GROWTH \"AT\" (HKD) ACC","LU1551013425.SGD":"Allianz Income and Growth Cl AMg2 DIS H2-SGD","LU0345770308.USD":"NINETY ONE GSF GLOBAL STRATEGIC EQUITY \"A\" (USD) ACC","LU0082616367.USD":"摩根大通美国科技A(dist)","LU0077335932.USD":"FIDELITY AMERICAN GROWTH \"A\" INC","LU2360107168.USD":"BGF NEXT GENERATION TECHNOLOGY \"A4\" (USD) INC","LU0345770993.USD":"NINETY ONE GSF GLOBAL STRATEGIC EQUITY \"A\" (USD) INC","LU0719512351.SGD":"JPMorgan Funds - US Technology A (acc) SGD","IE00BWXC8680.SGD":"PINEBRIDGE US LARGE CAP RESEARCH ENHANCED \"A5\" (SGD) ACC","LU1066053197.SGD":"HSBC GIF GLOBAL EQUITY VOLATILITY FOCUSED \"AM3\" (SGDHDG) INC","LU2023250330.USD":"ALLIANZ INCOME AND GROWTH \"AMG\" (USD) INC","LU1720051108.HKD":"ALLIANZ GLOBAL ARTIFICIAL INTELLIGENCE \"AT\" (HKD) ACC"},"source_url":"https://finance.sina.com.cn/jjxw/2025-02-19/doc-inekzcyh5678030.shtml","is_english":false,"share_image_url":"https://static.laohu8.com/b0d1b7e8843deea78cc308b15114de44","article_id":"2512812744","content_text":"近日,马斯克与xAI团队,在直播中正式发布了最新版本Grok3。此前,马斯克将Grok-3描述为“地球上最聪明的AI”。他在X平台上表示:“自己整个周末都在和团队打磨产品。”然而据媒体报道,有人测试了最新的Beta版Grok3,并提出了那个经典的用来刁难大模型的问题:“9.11与9.9哪个大?”遗憾的是,在不加任何定语以及标注的情况下,号称目前最聪明的Grok3,仍然无法正确回答这个问题。值得一提的是,用同样的问题询问DeepSeek时,无论是否开启深度思考(R1)模式,对方都给出了正确的答案:9.9大于9.11。“9.11和9.9哪个大”是AI领域的一个经典问题。艾伦研究机构(Allen Institute)成员林禹臣曾在社交媒体平台上发布的截图显示,ChatGPT-4o在回答中认为13.11比13.8更大。“一方面AI越来越擅长做数学奥赛题,但另一方面常识依旧很难。”他表示。随后Scale AI的提示工程师莱利·古德赛德(Riley Goodside)基于此灵感变换了问法,拷问了可能是当时最强的大模型ChatGPT-4o、谷歌Gemini Advanced以及Claude 3.5 Sonnet——9.11和9.9哪个更大?这几家主流大模型通通答错,他也成功将此话题传播开来。海外主流大模型答题。图源:第一财经Grok-3发布会的背景板上,写着\"our mission is to understand universe(我们的使命是了解宇宙)\"。马斯克曾表示,xAI公司的目标就是“了解宇宙”。在一周前,马斯克在直播中评论DeepSeek R1时,曾信心满满地表示“xAI即将推出更优秀的AI模型”。从现场展示的数据来看,Grok3在数学、科学与编程的基准测试上已经超越了目前所有的主流模型,马斯克甚至宣称Grok 3未来将用于SpaceX火星任务计算,并预测“三年内将实现诺贝尔奖级别突破”。xAI在X平台直播,马斯克到场。图源:中国新闻周刊马斯克强调,Grok-3可以减少AI幻觉,方法是通过来回检查数据并尝试实现逻辑一致性。他还透露,训练Grok-3所使用的算力远多于此前的版本,并使用了大量合成数据。不同于DeepSeek的算法优化路径(DeepSeek-V3用2048个H800 GPU,训练2788千小时),xAI透露,Grok-3的开发的得益于用8个月时间建成的Colossus超级计算机,它由10万个英伟达的H100 GPU驱动,为训练提供了2亿个GPU小时,比Grok-2多十多倍。此外,xAI宣布推出名为Deepsearch的Grok-3智能搜索引擎,名字和Deepseek颇有几分相似。来源:九派新闻综合东方财经、第一财经、中国新闻周刊等\n\n\n海量资讯、精准解读,尽在新浪财经APP","news_type":1,"symbols_score_info":{"TSLA":1,"TSYW.SI":0.6}},"isVote":1,"tweetType":1,"viewCount":1020,"commentLimit":10,"likeStatus":false,"favoriteStatus":false,"reportStatus":false,"symbols":[],"verified":2,"subType":0,"readableState":1,"langContent":"CN","currentLanguage":"CN","warmUpFlag":false,"orderFlag":false,"shareable":true,"causeOfNotShareable":"","featuresForAnalytics":[],"commentAndTweetFlag":false,"andRepostAutoSelectedFlag":false,"upFlag":false,"length":26,"optionInvolvedFlag":false,"xxTargetLangEnum":"ZH_CN"},"commentList":[],"isCommentEnd":true,"isTiger":false,"isWeiXinMini":false,"url":"/m/post/405307373937040"}
精彩评论