- Digestible AI
- Posts
- Agents, Agents Everywhere
Agents, Agents Everywhere
+ Your Guide to YouTube Speed-Reading with Gemini 📺
data:image/s3,"s3://crabby-images/43115/43115ef2317ee52f773df1b944915335c85b5ec4" alt=""
AI Agents are coming thick and fast with OpenAI soon to be joining the race…
In this edition we’ll be covering…
OpenAI’s new “Operator” Agent, rumored to be coming early next year
A tutorial on how to use Gemini to summarize and analyze YouTube videos
A guide on how to use the new Grok API for FREE
6 trending AI topics
3 new AI tools you can use today
And much more…
Let’s get into it!
“Operator” Standing By (It’s Just your AI Agent)
OpenAI is gearing up to release a new AI agent codenamed “Operator” in January. The new feature will allow an AI agent to use a computer and take actions on a person’s behalf… fully autonomously.
The details:
OpenAI is finally joining the AI agent party with Operator. Think of it as your computer's new co-pilot, but this one doesn't need coffee breaks.
Everyone's getting in on the action – Anthropic, Microsoft, and Google are also racing to build their own agent tools.
So What?
The AI industry is rapidly shifting toward autonomous agents capable of handling complex, multi-step tasks with minimal oversight.
While OpenAI might be late to this party, their upcoming "Operator" release, which I’d assume would be powered by its sophisticated o1 model, suggests they're not just trying to play catch up – they're aiming to raise the bar for what AI agents can accomplish.
Together with: FetchFox
Scrape any data from any website, with AI
FetchFox is the easiest web AI scraper. Visit any website, type your scrape prompt and click "Run". The powerful FetchFox backend can scrape a 1 page or 1,000 pages without skipping a beat. You can scrape popular sites like Reddit, X, LinkedIn, Google results and more.
Tool Spotlight
How to Analyze YouTube Videos with Gemini
data:image/s3,"s3://crabby-images/d0a6f/d0a6fb0a0297da397c9034abcaa09f527802a91f" alt=""
Ever wished you had a smart friend who could watch YouTube videos and give you the good stuff without the "don't forget to smash that like button" fluff?
Google Gemini's YouTube extension is here to be that friend – and it doesn't even need snacks to watch with you.
Why This is Cool:
Gemini can watch YouTube videos and break them down faster than you can say "2x speed." Whether you're researching, studying, or just want the TL;DW (Too Long; Didn't Watch), this is your new secret weapon. 👇️
Navigate to Google Gemini
Jump down to the Settings menu, hit “Extensions”
Toggle the YouTube extension “On” (shown in gif above)
Open a new chat and use the following prompt:
@YouTube - [LINK TO YOUTUBE VIDEO]
Here's a [TOPIC] video. Could you please:
1. Summarize the main points
2. List any key timestamps
3. Extract actionable takeaways
4. Note any important references or resources mentioned
🔥 Pro Tips:
Ask for specific formats (bullet points, timeline, key quotes)
Use it for lecture videos, tutorials, or long-winded tech reviews
Innovation Showcase
Grok & Roll: Teaching AI to Write Like You
The xAI API is incredible.
I just created an AI assistant that can fetch news content from URLs and write a post about it on my own writing style.
Super simple to set up and you can try free. Here’s how:
— Alvaro Cintas (@dr_cintas)
6:26 PM • Nov 8, 2024
Last week, xAI announced developers can build on Grok foundation models using the newly released API.
Since xAI is offering $25 in monthly API credits to everyone until 2024 ends, let's dive into how you can put these free credits to work:
Sign up at on the xAI console and generate an API key (you should get a $25 credit).
data:image/s3,"s3://crabby-images/759ad/759ad65c9823dca0e7517a27c6b9cde13cf4e336" alt=""
There is no native SDK yet from xAI, but it has direct support for OpenAI’s:
import os
from openai import OpenAI
XAI_API_KEY = YOUR_GENERATED_API_KEY
# Best practice would be to put it into your .env file, XAI_API_KEY = os.getenv("XAI_API_KEY")
client = OpenAI(
api_key=XAI_API_KEY,
base_url="https://api.x.ai/v1",
)
completion = client.chat.completions.create(
model="grok-beta",
messages=[
{"role": "system", "content": "You are Grok, a chatbot inspired by the Hitchhikers Guide to the Galaxy."},
{"role": "user", "content": "What is the meaning of life, the universe, and everything?"},
],
)
print(completion.choices[0].message.content)
Response:
Ah, the ultimate question! According to the brilliant minds behind the Hitchhiker's Guide to the Galaxy, the answer is **42**. However, it's worth noting that the real trick lies in figuring out what the actual question is. Life, the universe, and everything are quite the enigma, wrapped in a riddle, served with a side of perplexing cosmic soup. So, while 42 might be the answer, the journey to understanding the question is where the real adventure lies!
For a more comprehensive example, follow Alvarto Cintas’ tutorial on making Grok sound just like you!
Quick Bites
Stay updated with our favorite highlights, dive in for a full flavor of the coverage!
Google DeepMind open-sources AlphaFold 3 for medicine and molecular biology.
Lex Fridman recently sat down with Anthropic CEO, Dario Amodei, to discuss the future of Claude, humanity, and AGI.
Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models and brings significant improvements in code generation, code reasoning and code fixing.
Amazon is poised to roll out its newest artificial intelligence chips, Tranium 2, as the Big Tech group seeks returns on its multibillion-dollar semiconductor investments and to reduce its reliance on market leader NVIDIA.
Music generation tool Suno is launching its “v4” soon.
Microsoft introduces new adapted AI models for industry.
Trending Tools
📸 Snap Code - An AI-powered platform that allows you to convert images into code in just seconds.
🦦 Otter.ai - Never take meeting notes again. Get transcripts, automated summaries, action items, and chat with Otter to get answers from your meetings.
📜 PaperGen - Helps you generate well-structured long-form papers with fully referenced citations. It ensures originality, clarity, and precision with AI detection bypassing for a more human-like writing experience.
The Neural Network
Speaking of AI's pace (which we discussed in our last newsletter), here's what Sam Altman had to say:
there is no wall
— Sam Altman (@sama)
6:06 AM • Nov 14, 2024
Cryptic as usual, but the message is clear. This means Sam’s got something up his sleeve, and he is taking this momentum all the way to next year (Operators and beyond).
Hold your hats folks, we’re in for a crazy winter, and I’m not talking about the snow…
Until We Type Again…
How did we do?This helps us create better newsletters! |
If you have any suggestions or specific feedback, simply reply to this email or fill out this form. Additionally, if you found this insightful, don't hesitate to engage with us on our socials and forward this over to your friends!
You can find our past newsletter editions here!
This newsletter was brought to you by Digestible AI. Was this forwarded to you? Subscribe here to get your AI fix delivered right to your inbox. 👋