Wednesday, January 29, 2025

Efficient AI? DeepSeek vs NVIDIA, OpenAI

Andrej Karpathy on X: "DeepSeek (Chinese AI co) making it look easy today with an open weights release of a frontier-grade LLM trained on a joke of a budget (2048 GPUs for 2 months, $6M). For reference, this level of capability is supposed to require clusters of closer to 16K GPUs, the ones being" / X

"DeepSeek (Chinese AI co) making it look easy today with an open weights release of a frontier-grade LLM trained on a joke of a budget (2048 GPUs for 2 months, $6M).
...
Does this mean you don't need large GPU clusters for frontier LLMs? No but you have to ensure that you're not wasteful with what you have, and this looks like a nice demonstration that there's still a lot to get through with both data and algorithms."


Andrej Karpathy - Wikipedia

Andrej Karpathy ... computer scientist who served as the director of artificial intelligence and Autopilot Vision at Tesla. He co-founded and formerly worked at OpenAI, where he specialized in deep learning and computer vision.











Python Concurrency: Threads, Processes, asyncio

good explanation and code examples

 Python Concurrency: Threads, Processes, and asyncio Explained

  • threads are like having many workers share one computer
  • processes are like having many workers, each with their own computer
  • asyncio is like having one well organized worker who knows when to switch between different tasks

David Beazley - Python Concurrency From the Ground Up: LIVE! - PyCon 2015 - YouTube