How an 8B Model Beat an Industry Giant
This video describes how a system called ‘AgentStore’ was able to gain the top spot on a benchmark for AI agents – beating out a gigantic model with a small […]
Jasmeet Bhatia: Rising Star in Data & Analytics Jim Griffin
Michael Koved: The Economics of Generative AI Jim Griffin
Nachiket Mehta: Inside the Data Mesh at Wayfair Jim Griffin
Zach Elewitz: The Leak Stops Here Jim Griffin
Pavel Iakubovskii: Kaggle Master at Hugging Face Jim Griffin
Max Mozgovoy: the End of Traditional UX Research? Jim Griffin
Marwa Kechaou: A Keen Eye for Computer Vision Jim Griffin
Iqbal Hossain: The UofAZ Knowledge Map Story Jim Griffin
This is a special edition of the ‘AI World’ video series covering the release of OpenAI-o1 (alias Q* and Strawberry). By whatever name, this is a very powerful new kind of model that has demonstrated remarkable reasoning abilities.
The video starts with a look back in time at “Move 37” – an iconic moment in AI history during the 2016 match between AlphaGo and Lee Sedol. That was a moment when the world saw AI do something that looked a lot like reasoning or strategy, and the latent promise implied by that moment seems to coming to life at this very moment.
For its storyline, the video draws on two very recent papers (and very important) papers:
First, to illustrate the new model’s capabilities, the video showcases that model’s success at decoding an encrypted message, which is definitely not something that a basic language would be able to do.
And with that as context, the focus then turns to the Sequoia Capital investment hypothesis, which is that considerable value will be be unlocked by companies that apply agentic AI in a domain-specific context, especially if those use cases target specialized pools of work. To illustrate this, the video presents XBOW, which is a company that’s been able to use agentic AI to replace highly-skilled experts that do cyber-security penetration testing.
Building on the implications of that example, the video concludes with reflections on the enormous potential impact of these new capabilities – opportunities and risks that can be measured in the trillions of dollars.
This video describes how a system called ‘AgentStore’ was able to gain the top spot on a benchmark for AI agents – beating out a gigantic model with a small […]
Copyright AI Master Group 2023-24