DraganSr: AI: LLMs Intro by Andrej Karpathy, LLM "OS"

Sunday, November 26, 2023

llama-2-70b model (open source from Meta) = 140GB params + ~500 lines of .c code (!!!)

PhD from Stanford in AI/ML (CNNs)
Lead of self-driving in Tesla
Co-founder of OpenAI

system 1 vs system 2 "thinking"

LLMs are currently only "system 1" (fast and unreliable)

future: attempt "system 2 thinking", "trade time for accuracy", take more time for better result.

"self improvement": hard in language domain, no "rules"

"AI apps store"