| | | | | | xAI’s Grok-2 recently secured the #2 spot on the LMSys Chatbot Arena leaderboard, surpassing GPT-4o. No one saw this coming—not even your friendly AI Human, Amit Raja Naik. So, let’s take a deep breath and dive into the details, shall we? | | | | | | | | | | | | | | xAI’s Grok-2 has excelled particularly in mathematical tasks, ranking #1 in this category, and secured the #2 positions across various other tasks, including hard prompts, coding, and instruction-following. Additionally, Grok-2-Mini has undergone significant speed enhancements, now performing twice as fast as before. This boost was achieved after xAI’s inference team as they completely rewrote the inference stack using SGLang, enabling more efficient multi-host inference and improved accuracy. The team also introduced new algorithms for computation and communication kernels, alongside better batch scheduling and quantisation, further enhancing the models' performance. | | | | | | | | | While Google, OpenAI and Meta and others are still figuring it out—pushing their responsible AI agenda and promoting content partnerships, many believe that the real time information access and the least censorship aspects of xAI could be one of the reasons why Grok-2 is now officially the most useful LLM. Others can only dream of what xAI just did with its approach of little to no guardrails. “They had a version of Dolly called Image Gen, and it was prohibited from making human form,” said Buchheit discussing Google’s Imagen, and how the company has been struggling to dominate the AI landscape, given they have all the necessary resources and early start in AI. | | | | | | | | | Several people are still sceptical about the performance. OpenAI’s GPT-4o, which claims the top spot, does not perform as well as Claude 3.5, which is at the 5th spot. Though, people have started experimenting with Grok-2 and claim that the model is actually brilliant in coding and maths related tasks. Released in Beta this month, the Grok-2 family of models are also available for testing on X. The model also allows users to generate images using the FLUX.1 image generation model. | | | | | | | | | | | Sara Hooker, VP of research at Cohere AI, recently said, “When you try and make AI actually work for the world, you’re talking about this vast array of different languages. There are 7,000 languages in the world, and 80% of those have no text data.” This lack of diverse language data leads to models that overfit high-resource languages like English and Chinese while under-serving the “longtail” of low-resource languages. So what’s the solution? Read to find out. | | | | | | | Former OpenAI, Tesla engineer Andrej Karpathy lauded Cursor over GitHub Copilot, highlighting its growing dominance as an AI developer tool, and praised it for enhancing coding speed and adaptability. Redis released Redis 8 with integrated AI capabilities, expanding developer access to advanced AI features like Redis for AI, and cost-saving solutions like Redis Flex, enhancing workflows and scaling AI applications globally. Salesforce launched Einstein SDR Agent and Einstein Sales Coach Agent, two autonomous AI tools designed to boost sales team productivity by engaging leads and providing personalised coaching. Netflix partnerd with Snowflake to enhance its advertising capabilities using Data Clean Rooms, providing advertisers with secure, privacy-compliant insights and improved campaign performance. | | | | | | | | | Cypher 2024 marks a significant expansion as it celebrates its 8th edition by branching out to the USA in addition to its already established presence in India. Browse through the links below to learn more about the different editions of Cypher 2024. These links will guide you to comprehensive event information, including agendas, speakers, registration details, and more. | | | | | Enjoying Sector 6 (formerly AIM Daily XO)? Share it with colleagues or friends – they can sign up here. We love hearing from our readers! Have thoughts on our new format? Questions, comments, or ideas are always welcome. If there’s a specific topic in AI or analytics that you're curious about, tell us! Reach out to us at info@analyticsindiamag.com. Stay tuned for more insights in our next edition!
Curated with ♥️ in Namma Bengaluru | | | | | | | | |
Комментариев нет:
Отправить комментарий
Примечание. Отправлять комментарии могут только участники этого блога.