| | | | | | OpenAI o1 is simply brilliant, but the question remains: can it truly reason and think like humans? Your friendly AI Human, Amit Raja Naik, believes that most humans—especially those not reading this newsletter rn—barely think at all, living in their own bubble. | | | | | | | | | | | | | | Sam Altman, the CEO of OpenAI, calls the launch “the beginning of a new paradigm: AI that can do general-purpose complex reasoning.” The new model quite literally takes some time to think before responding, as opposed to earlier OpenAI models that start generating text as soon as you give it a prompt. The model does this by producing a long internal chain of thought prompting before responding to the user. That explains why, the team has also in some way suggested to not ask generic questions to the model as its high reasoning capabilities would work better for complex PhD-level problems, giving answers with PhD-level accuracy. But apart from coding and maths, this reasoning capability is the special highlight of the release. These ‘reasoning’ and ‘thinking’ capabilities were long being touted as the next frontier by Altman in all his speeches and it seems to finally be landing on the right spot. According to the Learning to Reason with LLMs blog, the reinforcement learning algorithm developed by OpenAI helps the model think more efficiently by refining its thought process through a data-efficient training method. Over time, the performance of “o1” improves as more training and thinking time is added. This differs from traditional LLM pretraining, which focuses more on expanding the size of the model, instead of focusing on increasing reasoning with a small model. Through reinforcement learning, o1 improves its reasoning skills by breaking down complex problems, correcting mistakes, and trying new approaches when needed. This greatly enhances its ability to handle complicated prompts that require more than just predicting the next word—it can backtrack and “think” through the task. However, a key challenge is that the model’s reasoning process remains hidden from users, even though they are billed for it, which are called “reasoning tokens.” OpenAI has explained that hiding the reasoning steps is necessary for two main reasons. First, for safety and policy compliance, as the model needs freedom to process without exposing sensitive intermediary steps. Second, to maintain a competitive advantage by preventing other models from using their reasoning work. This hidden process allows OpenAI to monitor the model’s thought patterns without interfering with its internal reasoning. | | | | | | | | | As Jim Fan explained, this “Project Strawberry” or o1 model is marking a significant shift towards inference-time scaling in production, a concept that focuses on improving reasoning through search rather than just learning. Reasoning doesn’t require large models. Many parameters in current models are dedicated to memorising facts for trivia-like benchmarks. Instead, reasoning can be handled by a smaller “reasoning core” that interacts with external tools, like browsers or code verifiers. This approach reduces the need for massive pre-training compute. A significant portion of compute is now dedicated to inference, rather than pre- or post-training. LLMs simulate various strategies, similar to how AlphaGo uses Monte Carlo Tree Search (MCTS). Over time, this leads to better solutions as the model converges on the best strategy. This was also explained by Subbarao Kambhampati in his post. OpenAI likely discovered the benefits of inference scaling early on, while academic research has only recently caught up. While effective in benchmarks, deploying o1 for real-world reasoning tasks presents challenges. Determining when to stop searching, defining reward functions, and managing compute costs for processes like code interpretation are complex issues that need to be solved for broader deployment. o1 can act as a data flywheel, where correct answers generate training data, complete with both positive and negative rewards. This process improves the reasoning core over time, similar to how AlphaGo’s value network refined itself through MCTS-generated data. This would in the end create more valuable data. | | | | | | | Enjoy the full story here. | | | | | How Generative AI Tools are Helping Police Solve Missing Children Cases in India | | | | | | | Recently, Dainik Bhaskar and the Rajasthan Police collaborated to use AI to solve cases of children gone missing. Joining this effort is Sahid NK, a young graphic designer who brings a creative touch to the project. In an exclusive interview with AIM, Sahid shared, “I get old, worn-out photos where the faces are so faded that it’s hard to even recognise them. We have to be imaginative and use the little details we have to recreate a face from the past.” Sahid mentioned that he relies on AI-powered tools to bring these faces back to life. “I use models trained to understand intricate facial features. It’s a blend of creativity, skill, and technology,” he said. Among his go-to tools are Freepic’s Picasso and the sophisticated Illusion Diffusion, a generative AI tool. | | | | | | | AI godmother Fei-Fei Li, alongside Justin Johnson, Christoph Lassner, and Ben Mildenhallhas has co-founded World Labs, securing $230 million to develop AI capable of understanding and interacting with the 3D physical world. The Telangana government has signed 26 MoUs with global tech giants like OpenAI, Meta, and NVIDIA to bolster AI infrastructure, establish an AI City near Hyderabad, and upskill 2.5 lakh students. Google DeepMind has unveiled ALOHA Unleashed and DemoStart, groundbreaking AI systems that dramatically enhance robot dexterity, enabling tasks like tying shoelaces and inserting gears with human-like precision, while using advanced reinforcement learning to cut training time by 100x. Y Combinator, led by Garry Tan, is expanding to four batches a year starting in 2025, marking its biggest shift since 2005 to accommodate the growing influx of AI startups. Adobe Express now supports eight Indian languages, including Hindi, Tamil, and Bengali, making its design tools and generative AI features more accessible to millions of users in India, as Adobe caters to the country’s rapidly expanding content creation market. | | | | | Leveraging Cloud for Seamless Data Integration and Breaking Down Data Silos | | | | | | | The DECODE webinar is back with its second episode, where leaders from DBS Bank and Google will explore how cloud technology can break down data silos, enabling seamless data integration, faster decision-making, and smarter business operations. Featuring insights from Luis Carlos Cruz Huertas (DBS Bank) and Sriram Venkat (Google), this session promises to deliver practical approaches for overcoming technical barriers in data integration. | | | | | | | | | Join the NVIDIA AI Summit India from October 23–25, 2024, at the Jio World Convention Centre in Mumbai to explore AI innovations across generative AI, robotics, supercomputing, and more, with 70% of use cases addressing India's grand challenges. Don't miss the Fireside Chat with NVIDIA CEO Jensen Huang on October 24. | | | | | AIM & NVIDIA Present DevPalooza 4.0: The Ultimate Developer Meetup in Bengaluru | | | | | | | Join us at DevPalooza 4.0 in Bengaluru, powered by AIM and NVIDIA, to dive into hands-on generative AI workshops, explore applications, and network with AI professionals—click here to register and secure your spot! | | | | | | | | | Cypher 2024 marks a significant expansion as it celebrates its 8th edition by branching out to the USA in addition to its already established presence in India. Browse through the links below to learn more about the different editions of Cypher 2024. These links will guide you to comprehensive event information, including agendas, speakers, registration details, and more. | | | | | Enjoying Sector 6 (formerly AIM Daily XO)? Share it with colleagues or friends – they can sign up here. We love hearing from our readers! Have thoughts on our new format? Questions, comments, or ideas are always welcome. If there’s a specific topic in AI or analytics that you're curious about, tell us! Reach out to us at info@analyticsindiamag.com. Stay tuned for more insights in our next edition!
Curated with ♥️ in Namma Bengaluru | | | | | | | | |
Комментариев нет:
Отправить комментарий
Примечание. Отправлять комментарии могут только участники этого блога.