Thursday, July 09, 2015

AVRO: fast data serialization

Welcome to Apache Avro!
Apache Avro™ is a data serialization system.

Apache Avro - Wikipedia, the free encyclopedia
"Avro is a remote procedure call and data serialization framework developed within Apache's Hadoop project. It usesJSON for defining data types and protocols, and serializes data in a compact binary format. Its primary use is in Apache Hadoop, where it can provide both a serialization format for persistent data, and a wire format for communication between Hadoop nodes, and from client programs to the Hadoop services.
It is similar to Thrift, but does not require running a code-generation program when a schema changes (unless desired forstatically-typed languages)"
State of the Art in Microservices @ InfoQ

Adrian Cockcroft on Microservices and DevOps @ InfoQ
"Inter-Service communication very important to understand and know the timings.
XML - 500 ms; JSON - 50 ms; Google Protocol Buffers ; Thrift; AVRO;
What is AVRO? Is the main process used by NetFlix for communication. Very fast."