play_arrow

keyboard_arrow_right

skip_previous play_arrow skip_next
00:00 00:00
playlist_play chevron_left
volume_up

Blog

9 Results / Page 1 of 1


Shark Alert! YOLO AI-Vision in Action

Jim Griffin August 17, 2024

Last week, several news outlets ran a story about SharkEye, which is an AI-vision shark detection program, developed at the University of California, Santa Barbara, and deployed at California’s Padaro Beach, which is an area where surfers and great white sharks are both frequently found. After quickly describing the program […]

Call a Doctor! –Blue Screen Lessons Learned

Jim Griffin July 24, 2024

Companies worldwide grappled on Friday with what Troy Hunt, famously described as “the largest IT outage in history,” caused by a faulty sensor configuration update that got pushed to Microsoft by the cyber-security giant, CrowdStrike, resulting in a $31 billion loss in market capitalization for the company. Specific information about […]

New Method Runs Big LLMs on Smartphones

Jim Griffin June 26, 2024

There’s a big breakthrough that just came out for handling large language models on smartphones. It’s called PowerInfer-2 and what it does is look at every option for a processing an LLM on a particular smartphone, and picks the fastest way for that particular LLM on that particular device. For […]

Nemotron-4 is BIG in More Ways than One

Jim Griffin June 20, 2024

Last week, NVIDIA announced Nemotron-4, which consists of three models: Base, Instruct and Reward. These three models work together within the NeMo framework to enable the creation and fine-tuning of new large language models. At 340 billion parameters, this new entrant far bigger than any other open source model, but […]

Testing Ollama on Hard Questions

Jim Griffin June 13, 2024

Ollama is a popular platform for running language models on your local machine, with access to almost 100 different open source models, including llama-3 from Meta, Phi3 from Microsoft, Aya 23 from Cohere, the Gemma models from DeepMind and Mistral. This video shows llama-3 being run on a laptop, using […]

Summarize THIS!

Jim Griffin June 3, 2024

This is a demo of Any Summary, which is a tool that uses OpenAI on the back end to summarize 12 different file types, up to 100 MB each, including PDFs, audio files, videos, and web pages. The context for the demo was an email I received on a Friday, […]

Mr. Bongo Makes a GPT

Jim Griffin June 3, 2024

This week, we joined up with Mr. Bongo to create a custom GPT, using Retrieval Augmented Generation (RAG), so it uses only our own internal documents, and is available only to our own internal users. This was done in a step-by-step manner, like a ‘How To’ video. For this demo, […]

Segment of One – Now it’s Real

Jim Griffin June 3, 2024

“Segment of One” is where every customer in a database of millions can be treated in a different way. Although there’s been buzz about that since at least 1989, true Segment of One is still rare. This video looks under the hood at the model and approach for a true […]

What is Pinecone?

Jim Griffin June 3, 2024

As you get ready to implement a generative AI project, you’ll probably start hearing about Pinecone. That’s because you’ll need a vector database, or some other kind of vector alternative as the memory layer, and since Pinecone was an early mover that helped to create the vector database category, you’ll […]