play_arrow

keyboard_arrow_right

skip_previous play_arrow skip_next
00:00 00:00
playlist_play chevron_left
volume_up

Blog

19 Results / Page 2 of 3


Will Open-Source Llama Beat GPT-4o?

Jim Griffin August 1, 2024

Last week Meta launched its newest family of models, Llama 3.1, including a new benchmark – an open-source foundation model with 405 billion parameters. With this, Zuckerberg predicted that Meta AI will surpass OpenAI’s 200 million monthly active users by the end of this year. Hubris aside, this video looks […]

Amazing Milestone! Million Experts Model

Jim Griffin July 17, 2024

A top researcher at Google DeepMind just released an important paper, “Mixture of a Million Experts.” As the paper’s title announces, it describes an approach that resulted in the first-known Transformer model with more than a million experts. For context, the number of experts currently seen in smaller models varies […]

How a Language Model Aced a Top Leaderboard

Jim Griffin July 3, 2024

This video shares details about a remarkable experiment by researchers in Tokyo, who teamed up with Oxford and Cambridge Universities to study whether large language models might now be able to write code that improves their own performance. The answer was Yes. Not only that, the model created a whole […]

New Method Runs Big LLMs on Smartphones

Jim Griffin June 26, 2024

There’s a big breakthrough that just came out for handling large language models on smartphones. It’s called PowerInfer-2 and what it does is look at every option for a processing an LLM on a particular smartphone, and picks the fastest way for that particular LLM on that particular device. For […]

Nemotron-4 is BIG in More Ways than One

Jim Griffin June 20, 2024

Last week, NVIDIA announced Nemotron-4, which consists of three models: Base, Instruct and Reward. These three models work together within the NeMo framework to enable the creation and fine-tuning of new large language models. At 340 billion parameters, this new entrant far bigger than any other open source model, but […]

Happy Birthday SETI@Home!

Jim Griffin June 3, 2024

SETI@home was officially launched on May 17, 1999 which makes it 25 years old this week, so Happy Birthday SETI! As you might recall, SETI stands for Search for Extraterrestrial Intelligence. This video describes the origins and background of SETI, and the amazing scale that it achieved worldwide. It then […]

Segment of One – Now it’s Real

Jim Griffin June 3, 2024

“Segment of One” is where every customer in a database of millions can be treated in a different way. Although there’s been buzz about that since at least 1989, true Segment of One is still rare. This video looks under the hood at the model and approach for a true […]

Has AI Learned to Lie? New Findings!

Jim Griffin June 3, 2024

This video describes an experiment where a Large-Language Model was convinced by a series of prompts that it should not tell the truth, and so it intentionally gave false information. That example is explored against a backdrop of some of the more famous cases where AI has deceived some of […]