探討關(guān)于多模態(tài)大模型落地機(jī)器人行業(yè)發(fā)展_行業(yè)資訊_新聞動(dòng)態(tài)-濟(jì)南泉誼機(jī)械科技有限公司

行業(yè)資訊

產(chǎn)品推薦 RECOMMEND

鋼雕模型

鋼雕模型

大型坦克模型

大型飛機(jī)模型制作

新聞推薦 RECOMMEND

您當(dāng)前所在位置首頁(yè)>>新聞動(dòng)態(tài)>>行業(yè)資訊探討關(guān)于多模態(tài)大模型落地機(jī)器人行業(yè)發(fā)展

探討關(guān)于多模態(tài)大模型落地機(jī)器人行業(yè)發(fā)展

發(fā)布時(shí)間：2024-10-31 來(lái)源：http:///

　　近期國(guó)內(nèi)多家企業(yè)在“大模型+機(jī)器人”已實(shí)現(xiàn)技術(shù)突破。

　　Recently, many domestic enterprises have achieved technological breakthroughs in "big models+robots".

　　業(yè)內(nèi)認(rèn)為，隨著技術(shù)的不斷進(jìn)步和應(yīng)用場(chǎng)景的擴(kuò)大，多模態(tài)大模型與機(jī)器人的需求將會(huì)不斷增加，為企業(yè)提供了廣闊市場(chǎng)空間。此外，與其他行業(yè)的合作也將為多模態(tài)大模型與機(jī)器人的發(fā)展帶來(lái)新機(jī)遇，例如與醫(yī)療、制造等行業(yè)的合作，可實(shí)現(xiàn)更廣泛的應(yīng)用場(chǎng)景和商業(yè)價(jià)值。

　　The industry believes that with the continuous advancement of technology and the expansion of application scenarios, the demand for multimodal large models and robots will continue to increase, providing a broad market space for enterprises. In addition, cooperation with other industries will also bring new opportunities for the development of multimodal large models and robots, such as cooperation with industries such as healthcare and manufacturing, which can achieve a wider range of application scenarios and commercial value.

　　多模態(tài)機(jī)器人實(shí)現(xiàn)技術(shù)突破

　　Breakthrough in multimodal robot technology

　　截至12月13日收盤(pán)，步科股份、埃夫特、綠的諧波等多只機(jī)器人概念股漲超4%。消息面上，特斯拉發(fā)布Optimus-Gen 2（第二代擎天柱）人形機(jī)器人視頻，其搭載由特斯拉設(shè)計(jì)的執(zhí)行器與傳感器，行走速度提高30%，平衡力及全身控制均得到提高。

　　As of the close on December 13th, several robot concept stocks such as BuTech, Evertech, and Green Harmonic have risen by over 4%. On the news front, Tesla released a video of the Optimus Gen 2 (second generation Optimus Prime) humanoid robot, which is equipped with Tesla designed actuators and sensors, increasing walking speed by 30% and improving balance and full body control.

　　“多模態(tài)”AI是指能處理文本、音頻、圖像、視頻和代碼等多種形式內(nèi)容的大模型。隨著多模態(tài)大模型快速迭代，國(guó)際大廠不斷關(guān)注其在機(jī)器人領(lǐng)域的應(yīng)用，并在機(jī)器人規(guī)劃、控制、導(dǎo)航等主要任務(wù)上進(jìn)行了探索。

　　Multimodal AI refers to large models capable of processing various forms of content such as text, audio, images, videos, and code. With the rapid iteration of multimodal large models, international giants are constantly paying attention to their applications in the field of robotics and exploring their main tasks such as robot planning, control, and navigation.

　　止于至善投資總經(jīng)理何理告訴《證券日?qǐng)?bào)》記者：“多模態(tài)大模型融合視覺(jué)、語(yǔ)音和傳感器數(shù)據(jù)處理技術(shù)，極大豐富了機(jī)器人認(rèn)知和決策層面。該技術(shù)在機(jī)器人中的應(yīng)用，有望使機(jī)器人在復(fù)雜交互、自然語(yǔ)言理解和環(huán)境適應(yīng)等領(lǐng)域邁出重大進(jìn)步，激發(fā)其作為高度自主助手或勞動(dòng)力的無(wú)限可能性。”

　　Zhi Zhi Shan Investment's General Manager He Li told Securities Daily reporters, "The fusion of multimodal large models with visual, speech, and sensor data processing technology greatly enriches the cognitive and decision-making levels of robots. The application of this technology in robots is expected to make significant progress in areas such as complex hybridization, natural language understanding, and environmental adaptation, stimulating their infinite possibilities as highly autonomous assistants or laborers

　　國(guó)內(nèi)已有企業(yè)在此領(lǐng)域搶先布局。12月12日晚，奧比中光發(fā)布大模型機(jī)械臂1.0產(chǎn)品，可通過(guò)語(yǔ)音Prompts作為輸入，利用多種大模型的理解能力和視覺(jué)感知能力，生成空間語(yǔ)義信息，讓機(jī)械臂理解、執(zhí)行動(dòng)作。在其同步披露的視頻中，機(jī)械臂成功完成了一系列語(yǔ)音口令，包括“把綠色方塊放到黃色框中”“請(qǐng)恢復(fù)最開(kāi)始的狀態(tài)”等。

　　Domestic enterprises have already taken the lead in this field. On the evening of December 12th, Obi Zhongguang released the Large Model Robot Arm 1.0 product, which can use voice Prompts as input and utilize the understanding and visual perception abilities of multiple large models to generate spatial semantic information, allowing the robot arm to understand and execute actions. In its synchronously disclosed video, the robotic arm successfully completed a series of voice commands, including "put the green square in the yellow box" and "please restore the initial state".

　　奧比中光聯(lián)合創(chuàng)始人、CTO肖振中告訴《證券日?qǐng)?bào)》記者：“公司希望通過(guò)工程化研究，使大模型機(jī)械臂在實(shí)際場(chǎng)景落地，包括提升機(jī)械臂自動(dòng)繞開(kāi)復(fù)雜障礙物來(lái)完成人類(lèi)指令的能力，解決大模型+機(jī)械臂的泛化性問(wèn)題，最終實(shí)現(xiàn)通用場(chǎng)景落地?！?/p>

　　Xiao Zhenzhong, co-founder and CTO of Obi Zhongguang, told Securities Daily reporters, "The company hopes to use engineering research to enable the implementation of large model robotic arms in practical scenarios, including improving the ability of robotic arms to automatically bypass complex obstacles to complete human commands, solving the generalization problem of large models and robotic arms, and ultimately achieving universal scenario implementation

　　據(jù)不完全統(tǒng)計(jì)，中科創(chuàng)達(dá)、億嘉和等上市公司亦于近期相繼披露了基于多模態(tài)大模型的機(jī)器人研發(fā)進(jìn)展情況。

　　According to incomplete statistics, listed companies such as Zhongke Chuangda and Yijiahe have recently disclosed their progress in robot research and development based on multimodal large models.

　　商業(yè)大規(guī)模應(yīng)用仍需時(shí)間

　　Large scale commercial applications still require time

　　我國(guó)機(jī)器人行業(yè)已具備一定產(chǎn)業(yè)基礎(chǔ)。頭腦聰明、四肢靈活得多的模態(tài)機(jī)器人正成為多方競(jìng)逐未來(lái)產(chǎn)業(yè)的新賽道。

　　China's robotics industry has established a certain industrial foundation. Modal robots with intelligent minds and much more flexible limbs are becoming a new track for multi-party competition in future industries.

　　何理認(rèn)為，在國(guó)內(nèi)市場(chǎng)，企業(yè)已積極投入關(guān)鍵技術(shù)環(huán)節(jié)的研發(fā)和生產(chǎn)，尤其是在傳感器、精密機(jī)械部件、執(zhí)行器以及創(chuàng)新材料和輕量化結(jié)構(gòu)件領(lǐng)域，展示了蓬勃發(fā)展勢(shì)頭。

　　He Li believes that in the domestic market, enterprises have actively invested in the research and development and production of key technological links, especially in the fields of sensors, precision mechanical components, actuators, innovative materials, and lightweight structural components, demonstrating a vigorous development momentum.

　　諧波減速器是工業(yè)機(jī)器人的核心零部件。綠的諧波披露，已較早完成工業(yè)機(jī)器人諧波減速器技術(shù)研發(fā)并實(shí)現(xiàn)規(guī)?；a(chǎn)，在該領(lǐng)域率先實(shí)現(xiàn)了對(duì)進(jìn)口產(chǎn)品的替代，極大降低了國(guó)產(chǎn)機(jī)器人企業(yè)的采購(gòu)成本及采購(gòu)周期。其推出的新一代Y系列諧波減速器，通過(guò)數(shù)理模型創(chuàng)新，軸承設(shè)計(jì)及加工工藝優(yōu)化，其剛度指標(biāo)較現(xiàn)有其他產(chǎn)品提升了一倍。

　　Harmonic reducer is the core component of industrial robots. Green harmonic disclosure has completed the research and development of industrial robot harmonic reducer technology earlier and achieved large-scale production. It has taken the lead in replacing imported products in this field, greatly reducing the procurement cost and procurement cycle of domestic robot enterprises. The new generation Y series harmonic reducer launched by it has doubled its stiffness index compared to other existing products through mathematical model innovation, bearing design and processing technology optimization.

　　肖振中對(duì)此表示認(rèn)同，他告訴《證券日?qǐng)?bào)》記者：“大語(yǔ)言模型（Large Language Model，LLM）結(jié)合視覺(jué)傳感，會(huì)讓各類(lèi)機(jī)器人、機(jī)械臂落地到更多場(chǎng)景中，如工業(yè)制造、柔性物流、商用服務(wù)等。目前大模型跟實(shí)際數(shù)據(jù)的結(jié)合還存在一定差距，大模型運(yùn)行消耗的算力也偏大，應(yīng)用需要三五年的時(shí)間逐步落地，業(yè)務(wù)成熟可能需要更久?！?/p>

　　Xiao Zhenzhong agrees with this and told Securities Daily reporters: "The combination of Large Language Model (LLM) and visual sensing will enable various robots and robotic arms to land in more scenarios, such as industrial manufacturing, flexible logistics, commercial services, etc. At present, there is still a certain gap between the integration of large models and actual data, and the computing power consumed by the operation of large models is also relatively high. The application will take three to five years to gradually land, and business maturity may take longer

　　“但公司堅(jiān)信這是正確的方向，前景廣闊?！毙ふ裰斜硎?，奧比中光正搭建機(jī)器人及AI視覺(jué)中臺(tái)，通過(guò)多模態(tài)視覺(jué)大模型及智能算法研發(fā)，結(jié)合機(jī)器人視覺(jué)傳感器，形成自主移動(dòng)定位導(dǎo)航和避障的完整產(chǎn)品方案，積極迎接智能機(jī)器人時(shí)代。

　　But the company firmly believes that this is the right direction with broad prospects, "said Xiao Zhenzhong. Obi Zhongguang is building a robot and AI vision platform, and through the research and development of multimodal vision models and intelligent algorithms, combined with robot vision sensors, has formed a complete product solution for autonomous mobile positioning, navigation, and obstacle avoidance, actively welcoming the era of intelligent robots.

　　本文的精彩內(nèi)容來(lái)自:大型機(jī)器人模型制作更多的詳細(xì)內(nèi)容請(qǐng)點(diǎn)擊我們網(wǎng)站：http://謝謝您的到來(lái)

　　The exciting content of this article comes from the production of large-scale robot models. For more detailed content, please click on our website: http:// Thank you for coming

大型飛機(jī)模型：殲-20戰(zhàn)斗機(jī)模型

大型機(jī)器人模型制作：如何在家簡(jiǎn)易制作紙質(zhì)機(jī)器人

成人午夜免费视频,一本一道久久a久久无码,6080无码,人妖一区二区三区