DraganSr: 2025-01-29

Wednesday, January 29, 2025

Efficient AI? DeepSeek vs NVIDIA, OpenAI

Andrej Karpathy on X: "DeepSeek (Chinese AI co) making it look easy today with an open weights release of a frontier-grade LLM trained on a joke of a budget (2048 GPUs for 2 months, $6M). For reference, this level of capability is supposed to require clusters of closer to 16K GPUs, the ones being" / X

"DeepSeek (Chinese AI co) making it look easy today with an open weights release of a frontier-grade LLM trained on a joke of a budget (2048 GPUs for 2 months, $6M).
...
Does this mean you don't need large GPU clusters for frontier LLMs? No but you have to ensure that you're not wasteful with what you have, and this looks like a nice demonstration that there's still a lot to get through with both data and algorithms."

Andrej Karpathy - Wikipedia

Andrej Karpathy ... computer scientist who served as the director of artificial intelligence and Autopilot Vision at Tesla. He co-founded and formerly worked at OpenAI, where he specialized in deep learning and computer vision.

Nvidia stock begins recovery after DeepSeek AI frenzy prompted near $600 billion loss

DeepSeek: all the news about the startup that’s shaking up AI stocks | The Verge

DeepSeek R1 Shocked The World - Reactions Explained - YouTube

Viral AI company DeepSeek releases new image model family | TechCrunch

DeepSeek stole our tech... says OpenAI - YouTube

Python Concurrency: Threads, Processes, asyncio

good explanation and code examples

Python Concurrency: Threads, Processes, and asyncio Explained

threads are like having many workers share one computer
processes are like having many workers, each with their own computer
asyncio is like having one well organized worker who knows when to switch between different tasks

David Beazley - Python Concurrency From the Ground Up: LIVE! - PyCon 2015 - YouTube