NVIDIA’s new Eureka AI agent uses GPT-4 by instructing the large language model (LLM) to build its reward-based reinforcement learning software program. It does not require complex instructions or even pre-written patterns; instead, it just begins perfecting a program and responds to external human feedback.
Researchers are using experimentation reinforcement learning to train robots to execute a growing number of jobs, which is difficult and time-consuming. Humans are employing huge language model AI to assist in the instruction process. In a recent experiment, this resulted in some dexterous, but virtual, robots.
NVIDIA Research used an AI system powered by OpenAI’s GPT-4 to teach a game of a robotic hand approximately 30 complicated tasks, such as tossing a ball, moving blocks, and some pen-twirling talents.
Linxi “Jim” Fan, a top researcher at NVIDIA, described Eureka as a unique combination of LLMs and GPU-accelerated modeling programming in the company’s announcement. He said they believe that Eureka will allow for dexterous robot control and offer a new way to create physically realistic animations for artists.
Eureka evaluates its obtained data and instructs the LLM to further develop its design after evaluating its training routine within a complex simulation program. As a result, a nearly self-iterative AI protocol effectively encoding a range of robotic hand designs to wield scissors, spin pens, and open cupboards within a physics-accurate simulated world has been developed.
Eureka’s replacements to human-made trial-and-error learning systems are not only successful; in most circumstances, they outperform those produced by humans. In the conclusions of the team’s open-source research article, Eureka-designed reward systems defeated humans’ code in more than 80% of the tasks, resulting in a boost of more than 50% in the robotic simulations.
Reference
https://www.popsci.com/technology/nvidia-eureka-ai-training/
Similar Posts
- Chinese Company DeepSeek Releases DeepSeek-Coder a LLM for Code Generation
- Alibaba’s Mobile-Agent: A Smart Mobile Assistant
- Grounded SAM: A Unified Model for Diverse Visual Tasks
- Gaussian Head Avatar: High Quality Head Avatar Generator
- Google DeepMind’s AlphaGeometry: Without Assistance Solving Olympiad Geometry Problems
- OMG-Seg: A Unified Segmentation Model