Wednesday, September 04, 2024

OpenAI Batch API: 50% lower costs!

Batch - OpenAI API

how to use OpenAI's Batch API to send asynchronous groups of requests with 50% lower costs, a separate pool of significantly higher rate limits, and a clear 24-hour turnaround time. The service is ideal for processing jobs that don't require immediate responses.
  • Better cost efficiency: 50% cost discount compared to synchronous APIs
  • Higher rate limits: Substantially more headroom compared to the synchronous APIs
  • Fast completion times: Each batch completes within 24 hours (and often more quickly)

Pricing | OpenAI


Eval benchmark

ada v2

text-embedding-3-small

text-embedding-3-large

MIRACL average

31.4

44.0

54.9

MTEB average

61.0

62.3

64.6

book: Designing Data-Intensive Applications (DDIA)

Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems: Kleppmann, Martin: 9781449373320: Amazon.com: Books


Designing Data-Intensive Applications (DDIA) — an O’Reilly book by Martin Kleppmann (The Wild Boar Book)

Martin Kleppmann is a researcher in distributed systems at the University of Cambridge. Previously he was a software engineer and entrepreneur at Internet companies including LinkedIn and Rapportive,

Designing Data-Intensive Applications @ Oreilly Learning


ept/ddia-references: Literature references for “Designing Data-Intensive Applications” @GitHub

ddia-references/README.md at master · ept/ddia-references

chapters as islands!


My Thoughts on Designing Data-Intensive Applications | by Martin Mička | Medium


Thinking in Events: From Databases to Distributed Collaboration Software (ACM DEBS 2021) - YouTube

Apache Samza
A distributed stream processing framework (from LinkedIn)