Blame - doc/thrift.tex - packaging/sources/thrift

blob: 39b03838be5cac97327d970c4edc29861728b924 [file] [log] [blame]

Mark Slee	24b49d3	2007-03-21 01:24:00 +0000	[diff] [blame]	1	%-----------------------------------------------------------------------------
				2	%
				3	% Thrift whitepaper
				4	%
				5	% Name: thrift.tex
				6	%
				7	% Authors: Mark Slee (mcslee@facebook.com)
				8	%
				9	% Created: 05 March 2007
				10	%
				11	%-----------------------------------------------------------------------------
				12
				13
				14	\documentclass[nocopyrightspace,blockstyle]{sigplanconf}
				15
				16	\usepackage{amssymb}
				17	\usepackage{amsfonts}
				18	\usepackage{amsmath}
Marc Slemko	10b3bdb	2007-04-01 09:14:05 +0000	[diff] [blame]	19	\usepackage{url}
Mark Slee	24b49d3	2007-03-21 01:24:00 +0000	[diff] [blame]	20
				21	\begin{document}
				22
				23	% \conferenceinfo{WXYZ '05}{date, City.}
				24	% \copyrightyear{2007}
				25	% \copyrightdata{[to be supplied]}
				26
				27	% \titlebanner{banner above paper title} % These are ignored unless
				28	% \preprintfooter{short description of paper} % 'preprint' option specified.
				29
				30	\title{Thrift: Scalable Cross-Language Services Implementation}
				31	\subtitle{}
				32
				33	\authorinfo{Mark Slee, Aditya Agarwal and Marc Kwiatkowski}
				34	{Facebook, 156 University Ave, Palo Alto, CA}
				35	{\{mcslee,aditya,marc\}@facebook.com}
				36
				37	\maketitle
				38
				39	\begin{abstract}
				40	Thrift is a software library and set of code-generation tools developed at
				41	Facebook to expedite development and implementation of efficient and scalable
				42	backend services. Its primary goal is to enable efficient and reliable
				43	communication across programming languages by abstracting the portions of each
				44	language that tend to require the most customization into a common library
				45	that is implemented in each language. Specifically, Thrift allows developers to
				46	define data types and service interfaces in a single language-neutral file
				47	and generate all the necessary code to build RPC clients and servers.
				48
				49	This paper details the motivations and design choices we made in Thrift, as
				50	well as some of the more interesting implementation details. It is not
				51	intended to be taken as research, but rather it is an exposition on what we did
				52	and why.
				53	\end{abstract}
				54
				55	% \category{D.3.3}{Programming Languages}{Language constructs and features}
				56
				57	%\terms
				58	%Languages, serialization, remote procedure call
				59
				60	%\keywords
				61	%Data description language, interface definition language, remote procedure call
				62
				63	\section{Introduction}
				64	As Facebook's traffic and network structure have scaled, the resource
				65	demands of many operations on the site (i.e. search,
				66	ad selection and delivery, event logging) have presented technical requirements
				67	drastically outside the scope of the LAMP framework. In our implementation of
				68	these services, various programming languages have been selected to
				69	optimize for the right combination of performance, ease and speed of
				70	development, availability of existing libraries, etc. By and large,
				71	Facebook's engineering culture has tended towards choosing the best
				72	tools and implementations avaiable over standardizing on any one
				73	programming language and begrudgingly accepting its inherent limitations.
				74
				75	Given this design choice, we were presented with the challenge of building
				76	a transparent, high-performance bridge across many programming languages.
				77	We found that most available solutions were either too limited, did not offer
				78	sufficient data type freedom, or suffered from subpar performance.
				79	\footnote{See Appendix A for a discussion of alternative systems.}
				80
				81	The solution that we have implemented combines a language-neutral software
				82	stack implemented across numerous programming languages and an associated code
				83	generation engine that transforms a simple interface and data definition
				84	language into client and server remote procedure call libraries.
				85	Choosing static code generation over a dynamic system allows us to create
				86	validated code with implicit guarantees that can be run without the need for
				87	any advanced intropsecive run-time type checking. It is also designed to
				88	be as simple as possible for the developer, who can typically define all
				89	the necessary data structures and interfaces for a complex service in a single
				90	short file.
				91
				92	Surprised that a robust open solution to these relatively common problems
				93	did not yet exist, we committed early on to making the Thrift implementation
				94	open source.
				95
				96	In evaluating the challenges of cross-language interaction in a networked
				97	environment, some key components were identified:
				98
				99	\textit{Types.} A common type system must exist across programming languages
				100	without requiring that the application developer use custom Thrift data types
				101	or write their own serialization code. That is,
				102	a C++ programmer should be able to transparently exchange a strongly typed
				103	STL map for a dynamic Python dictionary. Neither
				104	programmer should be forced to write any code below the application layer
				105	to achieve this. Section 2 details the Thrift type system.
				106
				107	\textit{Transport.} Each language must have a common interface to
				108	bidirectional raw data transport. The specifics of how a given
				109	transport is implemented should not matter to the service developer.
				110	The same application code should be able to run against TCP stream sockets,
				111	raw data in memory, or files on disk. Section 3 details the Thrift Transport
				112	layer.
				113
				114	\textit{Protocol.} Data types must have some way of using the Transport
				115	layer to encode and decode themselves. Again, the application
				116	developer need not be concerned by this layer. Whether the service uses
				117	an XML or binary protocol is immaterial to the application code.
				118	All that matters is that the data can be read and written in a consistent,
				119	deterministic matter. Section 4 details the Thrift Protocol layer.
				120
				121	\textit{Versioning.} For robust services, the involved data types must
				122	provide a mechanism for versioning themselves. Specifically,
				123	it should be possible to add or remove fields in an object or alter the
				124	argument list of a function without any interruption in service (or,
				125	worse yet, nasty segmentation faults). Section 5 details Thrift's versioning
				126	system.
				127
				128	\textit{Processors.} Finally, we generate code capable of processing data
Aditya Agarwal	af524ee	2007-03-31 08:28:06 +0000	[diff] [blame]	129	streams to accomplish remote procedure calls. Section 6 details the generated
Mark Slee	24b49d3	2007-03-21 01:24:00 +0000	[diff] [blame]	130	code and TProcessor paradigm.
				131
				132	Section 7 discusses implementation details, and Section 8 describes
				133	our conclusions.
				134
				135	\section{Types}
				136
				137	The goal of the Thrift type system is to enable programmers to develop using
				138	completely natively defined types, no matter what programming language they
				139	use. By design, the Thrift type system does not introduce any special dynamic
				140	types or wrapper objects. It also does not require that the developer write
				141	any code for object serialization or transport. The Thrift IDL file is
				142	logically a way for developers to annotate their data structures with the
				143	minimal amount of extra information necessary to tell a code generator
				144	how to safely transport the objects across languages.
				145
				146	\subsection{Base Types}
				147
				148	The type system rests upon a few base types. In considering which types to
				149	support, we aimed for clarity and simplicity over abundance, focusing
				150	on the key types available in all programming languages, ommitting any
				151	niche types available only in specific languages.
				152
				153	The base types supported by Thrift are:
				154	\begin{itemize}
				155	\item \texttt{bool} A boolean value, true or false
				156	\item \texttt{byte} A signed byte
				157	\item \texttt{i16} A 16-bit signed integer
				158	\item \texttt{i32} A 32-bit signed integer
				159	\item \texttt{i64} A 64-bit signed integer
				160	\item \texttt{double} A 64-bit floating point number
				161	\item \texttt{string} An encoding-agnostic text or binary string
				162	\end{itemize}
				163
				164	Of particular note is the absence of unsigned integer types. Because these
				165	types have no direct translation to native primitive types in many languages,
				166	the advantages they afford are lost. Further, there is no way to prevent the
				167	application developer in a language like Python from assigning a negative value
				168	to an integer variable, leading to unpredictable behavior. From a design
				169	standpoint, we observed that unsigned integers were very rarely, if ever, used
				170	for arithmetic purposes, but in practice were much more often used as keys or
				171	identifiers. In this case, the sign is irrelevant. Signed integers serve this
				172	same purpose and can be safely cast to their unsigned counterparts (most
				173	commonly in C++) when absolutely necessary.
				174
				175	\subsection{Containers}
				176
				177	Thrift containers are strongly typed containers that map to the most commonly
				178	used containers in common programming languages. They are annotated using
				179	C++ template (or Java Generics) style. There are three types available:
				180	\begin{itemize}
				181	\item \texttt{list<type>} An ordered list of elements. Translates directly into
				182	an STL vector, Java ArrayList, or native array in scripting languages. May
				183	contain duplicates.
				184	\item \texttt{set<type>} An unordered set of unique elements. Translates into
Aditya Agarwal	af524ee	2007-03-31 08:28:06 +0000	[diff] [blame]	185	an STL set, Java HashSet, or native dictionary in PHP/Python/Ruby.
Mark Slee	24b49d3	2007-03-21 01:24:00 +0000	[diff] [blame]	186	\item \texttt{map<type1,type2>} A map of strictly unique keys to values
				187	Translates into an STL map, Java HashMap, PHP associative array,
				188	or Python/Ruby dictionary.
				189	\end{itemize}
				190
				191	While defaults are provided, the type mappings are not explicitly fixed. Custom
				192	code generator directives have been added to substitute custom types in
				193	destination languages (i.e.
Aditya Agarwal	af524ee	2007-03-31 08:28:06 +0000	[diff] [blame]	194	\texttt{hash\_map} or Google's sparse hash map can be used in C++). The
Mark Slee	24b49d3	2007-03-21 01:24:00 +0000	[diff] [blame]	195	only requirement is that the custom types support all the necessary iteration
				196	primitives. Container elements may be of any valid Thrift type, including other
				197	containers or structs.
				198
				199	\subsection{Structs}
				200
Aditya Agarwal	af524ee	2007-03-31 08:28:06 +0000	[diff] [blame]	201	A Thrift struct defines a common object to be used across languages. A struct
Mark Slee	24b49d3	2007-03-21 01:24:00 +0000	[diff] [blame]	202	is essentially equivalent to a class in object oriented programming
				203	languages. A struct has a set of strongly typed fields, each with a unique
				204	name identifier. The basic syntax for defining a Thrift struct looks very
				205	similar to a C struct definition. Fields may be annotated with an integer field
				206	identifier (unique to the scope of that struct) and optional default values.
				207	Field identifiers will be automatically assigned if omitted, though they are
				208	strongly encouraged for versioning reasons discussed later.
				209
				210	\begin{verbatim}
				211	struct Example {
				212	1:i32 number=10,
				213	2:i64 bigNumber,
				214	3:double decimals,
				215	4:string name="thrifty"
				216	}\end{verbatim}
				217
				218	In the target language, each definition generates a type with two methods,
				219	\texttt{read} and \texttt{write}, which perform serialization and transport
				220	of the objects using a Thrift TProtocol object.
				221
				222	\subsection{Exceptions}
				223
				224	Exceptions are syntactically and functionally equivalent to structs except
				225	that they are declared using the \texttt{exception} keyword instead of the
				226	\texttt{struct} keyword.
				227
				228	The generated objects inherit from an exception base class as appropriate
				229	in each target programming language, the goal being to offer seamless
				230	integration with native exception handling for the developer in any given
				231	language. Again, the design emphasis is on making the code familiar to the
				232	application developer.
				233
				234	\subsection{Services}
				235
				236	Services are defined using Thrift types. Definition of a service is
				237	semantically equivalent to defining a pure virtual interface in object oriented
				238	programming. The Thrift compiler generates fully functional client and
				239	server stubs that implement the interface. Services are defined as follows:
				240
				241	\begin{verbatim}
				242	service <name> {
				243	<returntype> <name>(<arguments>)
				244	[throws (<exceptions>)]
				245	...
				246	}\end{verbatim}
				247
				248	An example:
				249
				250	\begin{verbatim}
				251	service StringCache {
				252	void set(1:i32 key, 2:string value),
				253	string get(1:i32 key) throws (1:KeyNotFound knf),
				254	void delete(1:i32 key)
				255	}
				256	\end{verbatim}
				257
				258	Note that \texttt{void} is a valid type for a function return, in addition to
				259	all other defined Thrift types. Additionally, an \texttt{async} modifier
				260	keyword may be added to a void function, which will generate code that does
				261	not wait for a response from the server. Note that a pure \texttt{void}
				262	function will return a response to the client which guarantees that the
				263	operation has completed on the server side. With \texttt{async} method calls
				264	the client can only be guaranteed that the request succeeded at the
				265	transport layer. (In many transport scenarios this is inherently unreliable
				266	due to the Byzantine Generals' Problem. Therefore, application developers
				267	should take care only to use the async optimization in cases where dopped
				268	method calls are acceptable or the transport is known to be reliable.)
				269
				270	Also of note is the fact that argument and exception lists to functions are
				271	implemented as Thrift structs. They are identical in both notation and
				272	behavior.
				273
				274	\section{Transport}
				275
				276	The transport layer is used by the generated code to facilitate data transfer.
				277
				278	\subsection{Interface}
				279
				280	A key design choice in the implementation of Thrift was to abstract the
				281	transport layer from the code generation layer. Though Thrift is typically
				282	used on top of the TCP/IP stack with streaming sockets as the base layer of
				283	communication, there was no compelling reason to build that constraint into
				284	the system. The performance tradeoff incurred by an abstracted I/O layer
				285	(roughly one virtual method lookup / function call per operation) was
				286	immaterial compared to the cost of actual I/O operations (typically invoking
				287	system calls).
				288
Aditya Agarwal	af524ee	2007-03-31 08:28:06 +0000	[diff] [blame]	289	Fundamentally, generated Thrift code only needs to know how to read and
Mark Slee	24b49d3	2007-03-21 01:24:00 +0000	[diff] [blame]	290	write data. Where the data is going is irrelevant, it may be a socket, a
				291	segment of shared memory, or a file on the local disk. The Thrift transport
				292	interface supports the following methods.
				293
				294	\begin{itemize}
				295	\item \texttt{open()} Opens the tranpsort
				296	\item \texttt{close()} Closes the tranport
				297	\item \texttt{isOpen()} Whether the transport is open
				298	\item \texttt{read()} Reads from the transport
				299	\item \texttt{write()} Writes to the transport
				300	\item \texttt{flush()} Force any pending writes
				301	\end{itemize}
				302
				303	There are a few additional methods not documented here which are used to aid
				304	in batching reads and optionally signaling completion of reading or writing
				305	chunks of data by the generated code.
				306
				307	In addition to the above
				308	\texttt{TTransport} interface, there is a \texttt{TServerTransport} interface
				309	used to accept or create primitive transport objects. Its interface is as
				310	follows:
				311
				312	\begin{itemize}
				313	\item \texttt{open()} Opens the tranpsort
				314	\item \texttt{listen()} Begins listening for connections
				315	\item \texttt{accept()} Returns a new client transport
				316	\item \texttt{close()} Closes the transport
				317
				318	\end{itemize}
				319
				320	\subsection{Implementation}
				321
				322	The transport interface is designed for simple implementation in any
				323	programming language. New transport mechanisms can be easily defined as needed
				324	by application developers.
				325
				326	\subsubsection{TSocket}
				327
				328	The \texttt{TSocket} class is implemented across all target languages. It
				329	provides a common, simple interface to a TCP/IP stream socket.
				330
				331	\subsubsection{TFileTransport}
				332
				333	The \texttt{TFileTransport} is an abstraction of an on-disk file to a data
Aditya Agarwal	af524ee	2007-03-31 08:28:06 +0000	[diff] [blame]	334	stream. It can be used to write out a set of incoming thrift request to a file
				335	on disk. The on-disk data can then be replayed from the log, either for post-processing
				336	or for recreation and simulation of past events. \texttt(TFileTransport).
Mark Slee	24b49d3	2007-03-21 01:24:00 +0000	[diff] [blame]	337
				338	\subsubsection{Utilities}
				339
				340	The Transport interface is designed to support easy extension using common
				341	OOP techniques such as composition. Some simple utilites include the
				342	\texttt{TBufferedTransport}, which buffers writes and reads on an underlying
				343	transport, the \texttt{TFramedTransport}, which transmits data with frame
				344	size headers for chunking optimzation or nonblocking operation, and the
				345	\texttt{TMemoryBuffer}, which allows reading and writing directly from heap or
				346	stack memory owned by the process.
				347
				348	\section{Protocol}
				349
				350	A second major abstraction in Thrift is the separation of data structure from
				351	transport representation. Thrift enforces a certain messaging structure when
				352	transporting data, but it is agnostic to the protocol encoding in use. That is,
				353	it does not matter whether data is encoded in XML, human-readable ASCII, or a
				354	dense binary format, so long as the data supports a fixed set of operations
				355	that allow generated code to deterministically read and write.
				356
				357	\subsection{Interface}
				358
				359	The Thrift Protocol interface is very straightforward. It fundamentally
				360	supports two things: 1) bidirectional sequenced messaging, and
				361	2) encoding of base types, containers, and structs.
				362
				363	\begin{verbatim}
				364	writeMessageBegin(name, type, seq)
				365	writeMessageEnd()
				366	writeStructBegin(name)
				367	writeStructEnd()
				368	writeFieldBegin(name, type, id)
				369	writeFieldEnd()
				370	writeFieldStop()
				371	writeMapBegin(ktype, vtype, size)
				372	writeMapEnd()
				373	writeListBegin(etype, size)
				374	writeListEnd()
				375	writeSetBegin(etype, size)
				376	writeSetEnd()
				377	writeBool(bool)
				378	writeByte(byte)
				379	writeI16(i16)
				380	writeI32(i32)
				381	writeI64(i64)
				382	writeDouble(double)
				383	writeString(string)
				384
				385	name, type, seq = readMessageBegin()
				386	readMessageEnd()
				387	name = readStructBegin()
				388	readStructEnd()
				389	name, type, id = readFieldBegin()
				390	readFieldEnd()
				391	k, v, size = readMapBegin()
				392	readMapEnd()
				393	etype, size = readListBegin()
				394	readListEnd()
				395	etype, size = readSetBegin()
				396	readSetEnd()
				397	bool = readBool()
				398	byte = readByte()
				399	i16 = readI16()
				400	i32 = readI32()
				401	i64 = readI64()
				402	double = readDouble()
				403	string = readString()
				404	\end{verbatim}
				405
				406	Note that every write function has exactly one read function counterpart, with
				407	the exception of the \texttt{writeFieldStop()} method. This is a special method
				408	that signals the end of a struct. The procedure for reading a struct is to
				409	\texttt{readFieldBegin()} until the stop field is encountered, and to then
				410	\texttt{readStructEnd()}. The
				411	generated code relies upon this structure to ensure that everything written by
				412	a protocol encoder can be read by a matching protocol decoder. Further note
				413	that this set of functions is by design more robust than necessary.
				414	For example, \texttt{writeStructEnd()} is not strictly necessary, as the end of
				415	a struct may be implied by the stop field. This method is a convenience for
				416	verbose protocols where it is cleaner to separate these calls (i.e. a closing
				417	\texttt{</struct>} tag in XML).
				418
				419	\subsection{Structure}
				420
				421	Thrift structures are designed to support encoding into a streaming
				422	protocol. That is, the implementation should never need to frame or compute the
				423	entire data length of a structure prior to encoding it. This is critical to
				424	performance in many scenarios. Consider a long list of relatively large
				425	strings. If the protocol interface required reading or writing a list as an
				426	atomic operation, then the implementation would require a linear pass over the
				427	entire list before encoding any data. However, if the list can be written
				428	as iteration is performed, the corresponding read may begin in parallel,
Aditya Agarwal	af524ee	2007-03-31 08:28:06 +0000	[diff] [blame]	429	theoretically offering an end-to-end speedup of $(kN - C)$, where $N$ is the size
Mark Slee	24b49d3	2007-03-21 01:24:00 +0000	[diff] [blame]	430	of the list, $k$ the cost factor associated with serializing a single
				431	element, and $C$ is fixed offset for the delay between data being written
				432	and becoming available to read.
				433
				434	Similarly, structs do not encode their data lengths a priori. Instead, they are
				435	encoded as a sequence of fields, with each field having a type specifier and a
				436	unique field identifier. Note that the inclusion of type specifiers enables
				437	the protocol to be safely parsed and decoded without any generated code
				438	or access to the original IDL file. Structs are terminated by a field header
				439	with a special \texttt{STOP} type. Because all the basic types can be read
				440	deterministically, all structs (including those with nested structs) can be
				441	read deterministically. The Thrift protocol is self-delimiting without any
				442	framing and regardless of the encoding format.
				443
				444	In situations where streaming is unnecessary or framing is advantageous, it
				445	can be very simply added into the transport layer, using the
				446	\texttt{TFramedTransport} abstraction.
				447
				448	\subsection{Implementation}
				449
				450	Facebook has implemented and deployed a space-efficient binary protocol which
				451	is used by most backend services. Essentially, it writes all data
				452	in a flat binary format. Integer types are converted to network byte order,
				453	strings are prepended with their byte length, and all message and field headers
				454	are written using the primitive integer serialization constructs. String names
				455	for fields are omitted - when using generated code, field identifiers are
				456	sufficient.
				457
				458	We decided against some extreme storage optimizations (i.e. packing
				459	small integers into ASCII or using a 7-bit continuation format) for the sake
				460	of simplicity and clarity in the code. These alterations can easily be made
				461	if and when we encounter a performance critical use case that demands them.
				462
				463	\section{Versioning}
				464
				465	Thrift is robust in the face of versioning and data definition changes. This
				466	is critical to enable a staged rollout of changes to deployed services. The
				467	system must be able to support reading of old data from logfiles, as well as
				468	requests from out of date clients to new servers, or vice versa.
				469
				470	\subsection{Field Identifiers}
				471
				472	Versioning in Thrift is implemented via field identifiers. The field header
				473	for every member of a struct in Thrift is encoded with a unique field
				474	identifier. The combination of this field identifier and its type specifier
				475	is used to uniquely identify the field. The Thrift definition language
				476	supports automatic assignment of field identifiers, but it is good
				477	programming practice to always explicitly specify field identifiers.
				478	Identifiers are specified as follows:
				479
				480	\begin{verbatim}
				481	struct Example {
				482	1:i32 number=10,
				483	2:i64 bigNumber,
				484	3:double decimals,
				485	4:string name="thrifty"
				486	}\end{verbatim}
				487
				488	To avoid conflicts, fields with omitted identifiers are automatically assigned
				489	decrementing from -1, and the language only supports the manual assignment of
				490	positive identifiers.
				491
				492	When data is being deserialized, the generated code can use these identifiers
				493	to properly identify the field and determine whether it aligns with a field in
				494	its definition file. If a field identifier is not recognized, the generated
				495	code can use the type specifier to skip the unknown field without any error.
				496	Again, this is possible due to the fact that all data types are self
				497	delimiting.
				498
				499	Field identifiers can (and should) also be specified in function argument
				500	lists. In fact, argument lists are not only represented as structs on the
				501	backend, but actually share the same code in the compiler frontend. This
				502	allows for version-safe modification of method parameters
				503
				504	\begin{verbatim}
				505	service StringCache {
				506	void set(1:i32 key, 2:string value),
				507	string get(1:i32 key) throws (1:KeyNotFound knf),
				508	void delete(1:i32 key)
				509	}
				510	\end{verbatim}
				511
				512	The syntax for specifying field identifiers was chosen to echo their structure.
				513	Structs can be thought of as a dictionary where the identifiers are keys, and
				514	the values are strongly typed, named fields.
				515
				516	Field identifiers internally use the \texttt{i16} Thrift type. Note, however,
				517	that the \texttt{TProtocol} abstraction may encode identifiers in any format.
				518
				519	\subsection{Isset}
				520
				521	When an unexpected field is encountered, it can be safely ignored and
				522	discarded. When an expected field is not found, there must be some way to
				523	signal to the developer that it was not present. This is implemented via an
				524	inner \texttt{isset} structure inside the defined objects. (In PHP, this is
				525	implicit with a \texttt{null} value, or \texttt{None} in Python
				526	and \texttt{nil} in Ruby.) Essentially,
				527	the inner \texttt{isset} object of each Thrift struct contains a boolean value
				528	for each field which denotes whether or not that field is present in the
				529	struct. When a reader receives a struct, it should check for a field being set
				530	before operating directly on it.
				531
				532	\begin{verbatim}
				533	class Example {
				534	public:
				535	Example() :
				536	number(10),
				537	bigNumber(0),
				538	decimals(0),
				539	name("thrifty") {}
				540
				541	int32_t number;
				542	int64_t bigNumber;
				543	double decimals;
				544	std::string name;
				545
				546	struct __isset {
				547	__isset() :
				548	number(false),
				549	bigNumber(false),
				550	decimals(false),
				551	name(false) {}
				552	bool number;
				553	bool bigNumber;
				554	bool decimals;
				555	bool name;
				556	} __isset;
				557	...
				558	}
				559	\end{verbatim}
				560
				561	\subsection{Case Analysis}
				562
				563	There are four cases in which version mismatches may occur.
				564
				565	\begin{enumerate}
				566	\item \textit{Added field, old client, new server.} In this case, the old
				567	client does not send the new field. The new server recognizes that the field
				568	is not set, and implements default behavior for out of date requests.
				569	\item \textit{Removed field, old client, new server.} In this case, the old
				570	client sends the removed field. The new server simply ignores it.
				571	\item \textit{Added field, new client, old server.} The new client sends a
				572	field that the old server does not recognize. The old server simply ignores
				573	it and processes as normal.
				574	\item \textit{Removed field, new client, old server.} This is the most
				575	dangerous case, as the old server is unlikely to have suitable default
				576	behavior implemented for the missing field. It is recommended that in this
				577	situation the new server be rolled out prior to the new clients.
				578	\end{enumerate}
				579
				580	\subsection{Protocol/Transport Versioning}
				581	The \texttt{TProtocol} abstractions are also designed to give protocol
				582	implementations the freedom to version themselves in whatever manner they
				583	see fit. Specifically, any protocol implementation is free to send whatever
				584	it likes in the \texttt{writeMessageBegin()} call. It is entirely up to the
				585	implementor how to handle versioning at the protocol level. The key point is
				586	that protocol encoding changes are safely isolated from interface definition
				587	version changes.
				588
				589	Note that the exact same is true of the \texttt{TTransport} interface. For
				590	example, if we wished to add some new checksumming or error detection to the
				591	\texttt{TFileTransport}, we could simply add a version header into the
				592	data it writes to the file in such a way that it would still accept old
				593	logfiles without the given header.
				594
				595	\section{RPC Implementation}
				596
				597	\subsection{TProcessor}
				598
				599	The last core interface in the Thrift design is the \texttt{TProcessor},
				600	perhaps the most simple of the constructs. The interface is as follows:
				601
				602	\begin{verbatim}
				603	interface TProcessor {
				604	bool process(TProtocol in, TProtocol out)
				605	throws TException
				606	}
				607	\end{verbatim}
				608
				609	The key design idea here is that the complex systems we build can fundamentally
				610	be broken down into agents or services that operate on inputs and outputs. In
				611	most cases, there is actually just one input and output (an RPC client) that
				612	needs handling.
				613
				614	\subsection{Generated Code}
				615
				616	When a service is defined, we generate a
				617	\texttt{TProcessor} instance capable of handling RPC requests to that service,
				618	using a few helpers. The fundamental structure (illustrated in pseudo-C++) is
				619	as follows:
				620
				621	\begin{verbatim}
				622	Service.thrift
				623	=> Service.cpp
				624	interface ServiceIf
				625	class ServiceClient : virtual ServiceIf
				626	TProtocol in
				627	TProtocol out
				628	class ServiceProcessor : TProcessor
				629	ServiceIf handler
				630
				631	ServiceHandler.cpp
				632	class ServiceHandler : virtual ServiceIf
				633
				634	TServer.cpp
				635	TServer(TProcessor processor,
				636	TServerTransport transport,
				637	TTransportFactory tfactory,
				638	TProtocolFactory pfactory)
				639	serve()
				640	\end{verbatim}
				641
				642	From the thrift definition file, we generate the virtual service interface.
				643	A client class is generated, which implements the interface and
				644	uses two \texttt{TProtocol} instances to perform the I/O operations. The
				645	generated processor implements the \texttt{TProcessor} interface. The generated
				646	code has all the logic to handle RPC invocations via the \texttt{process()}
				647	call, and takes as a parameter an instance of the service interface,
				648	implemented by the application developer.
				649
				650	The user provides an implementation of the application interface in their own,
				651	non-generated source file.
				652
				653	\subsection{TServer}
				654
				655	Finally, the Thrift core libraries provide a \texttt{TServer} abstraction.
				656	The \texttt{TServer} object generally works as follows.
				657
				658	\begin{itemize}
				659	\item Use the \texttt{TServerTransport} to get a \texttt{TTransport}
				660	\item Use the \texttt{TTransportFactory} to optionally convert the primitive
				661	transport into a suitable application transport (typically the
				662	\texttt{TBufferedTransportFactory} is used here)
				663	\item Use the \texttt{TProtocolFactory} to create an input and output protocol
				664	for the \texttt{TTransport}
				665	\item Invoke the \texttt{process()} method of the \texttt{TProcessor} object
				666	\end{itemize}
				667
				668	The layers are appropriately separated such that the server code needs to know
				669	nothing about any of the transports, encodings, or applications in play. The
				670	server encapsulates the logic around connection handling, threading, etc.
				671	while the processor deals with RPC. The only code written by the application
				672	developer lives in the definitional thrift file and the interface
				673	implementation.
				674
				675	Facebook has deployed multiple \texttt{TServer} implementations, including
				676	the single-threaded \texttt{TSimpleServer}, thread-per-connection
				677	\texttt{TThreadedServer}, and thread-pooling \texttt{TThreadPoolServer}.
				678
				679	The \texttt{TProcessor} interface is very general by design. There is no
				680	requirement that a \texttt{TServer} take a generated \texttt{TProcessor}
				681	object. Thrift allows the application developer to easily write any type of
				682	server that operates on \texttt{TProtocol} objects (for instance, a server
				683	could simply stream a certain type of object without any actual RPC method
				684	invocation).
				685
				686	\section{Implementation Details}
				687	\subsection{Target Languages}
				688	Thrift currently supports five target languages: C++, Java, Python, Ruby, and
				689	PHP. At Facebook, we have deployed servers predominantly in C++, Java, and
				690	Python. Thrift services implemented in PHP have also been embedded into the
				691	Apache web server, providing transparent backend access to many of our
				692	frontend constructs using a \texttt{THttpClient} implementation of the
				693	\texttt{TTransport} interface.
				694
				695	Though Thrift was explicitly designed to be much more efficient and robust
				696	than typical web technologies, as we were designing our XML-based REST web
				697	services API we noticed that Thrift could be easily used to define our
				698	service interface. Though we do not currently employ SOAP envelopes (in the
				699	author's opinion there is already far too much repetetive enterprise Java
				700	software to do that sort of thing), we were able to quickly extend Thrift to
				701	generate XML Schema Definition files for our service, as well as a framework
				702	for versioning different implementations of our web service. Though public
				703	web services are admittedly tangential to Thrift's core use case and design,
				704	Thrift facilitated rapid iteration and affords us the ability to quickly
				705	migrate our entire XML-based web service onto a higher performance system
				706	should the future need arise.
				707
				708	\subsection{Generated Structs}
				709	We made a conscious decision to make our generated structs as transparent as
				710	possible. All fields are publicly accessible; there are no \texttt{set()} and
				711	\texttt{get()} methods. Similarly, use of the \texttt{isset} object is not
				712	enforced. We do not include any \texttt{FieldNotSetException} construct.
				713	Developers have the option to use these fields to write more robust code, but
				714	the system is robust to the developer ignoring the \texttt{isset} construct
				715	entirely and will provide suitable default behavior in all cases.
				716
				717	The reason for this choice was for ease of application development. Our stated
				718	goal is not to make developers learn a rich new library in their language of
				719	choice, but rather to generate code that allow them to work with the constructs
				720	that are most familiar in each language.
				721
				722	We also made the \texttt{read()} and \texttt{write()} methods of the generated
				723	objects public members so that the objects can be used outside of the context
				724	of RPC clients and servers. Thrift is a useful tool simply for generating
				725	objects that are easily serializable across programming languages.
				726
				727	\subsection{RPC Method Identification}
				728	Method calls in RPC are implemented by sending the method name as a string. One
				729	issue with this approach is that longer method names require more bandwidth.
				730	We experimented with using fixed-size hashes to identify methods, but in the
				731	end concluded that the savings were not worth the headaches incurred. Reliably
				732	dealing with conflicts across versions of an interface definition file is
				733	impossible without a meta-storage system (i.e. to generate non-conflicting
				734	hashes for the current version of a file, we would have to know about all
				735	conflicts that ever existed in any previous version of the file).
				736
				737	We wanted to avoid too many unnecessary string comparisons upon
				738	method invocation. To deal with this, we generate maps from strings to function
				739	pointers, so that invocation is effectively accomplished via a constant-time
				740	hash lookup in the common case. This requires the use of a couple interesting
				741	code constructs. Because Java does not have function pointers, process
				742	functions are all private member classes implementing a common interface.
				743
				744	\begin{verbatim}
				745	private class ping implements ProcessFunction {
				746	public void process(int seqid,
				747	TProtocol iprot,
				748	TProtocol oprot)
				749	throws TException
				750	{ ...}
				751	}
				752
				753	HashMap<String,ProcessFunction> processMap_ =
				754	new HashMap<String,ProcessFunction>();
				755	\end{verbatim}
				756
				757	In C++, we use a relatively esoteric language construct: member function
				758	pointers.
				759
				760	\begin{verbatim}
				761	std::map<std::string,
				762	void (ExampleServiceProcessor::*)(int32_t,
				763	facebook::thrift::protocol::TProtocol*,
				764	facebook::thrift::protocol::TProtocol*)>
				765	processMap_;
				766	\end{verbatim}
				767
				768	Using these techniques, the cost of string processing is minimized, and we
				769	reap the benefit of being able to easily debug corrupt or misunderstood data by
				770	looking for string contents.
				771
				772	\subsection{Servers and Multithreading}
Marc Slemko	10b3bdb	2007-04-01 09:14:05 +0000	[diff] [blame]	773	Thrift services require basic multithreading services to handle simultaneous
				774	requests from multiple clients. For the python and java implementations of
				775	thrift server logic, the multi-thread support provided by those runtimes was more
				776	than adequate. For the C++ implementation no standard multithread runtime
				777	library support exists. Specifically a robust, lightweight, and portable
				778	thread manager and timer class implementation do not exist. We investigated
				779	existing implementations, namely {\tt boost::thread},
				780	{\tt boost::threadpool}, {\tt ACE\_Thread\_Manager} and {\tt ACE\_Timer}.
				781
				782	While {\tt boost::threads \cite{boost.threads} } provides clean, lightweight and
				783	robust implementations of multi-thread primitives (mutexes, conditions, threads)
				784	it does not provide a thread manager or timer implementation.
				785
				786	{\tt boost::threadpool \cite{boost.threadpool} } also looked promising but was not
				787	far enough along for our purposes. We wanted to limit the dependency on
				788	thirdparty libraries as much as possible. Because {\tt boost::threadpool} is not
				789	a pure template library and requires runtime libraries and because it is not yet
				790	part of the official boost distribution we felt it was not ready for use in thrift.
				791	As {\tt boost::threadpool} evolves and especially if it is added to the boost
				792	distribution we may reconsider our decision not to use it.
				793
				794	ACE has both a thread manager and timer class in addition to multi-thread
				795	primitives. The biggest problem with ACE is that it is ACE. Unlike boost, ACE
				796	API quality is poor. Everything in ACE has large numbers of dependencies on
				797	everything else in ACE - thus forcing developers to throw out standard classes,
				798	like STL collection is favor of ACE's homebrewed implementations. In addition,
				799	unlike boost, ACE implementations demonstrate little understanding of the power
				800	and pitfalls of C++ programming and take no advantage of modern templating
				801	techniques to ensure compile time safety and reasonable compiler error messages.
				802	For all these reasons, ACE was rejected.
				803
				804	\subsection{Thread Primitives}
				805
				806	The thrift thread libraries have three components
				807	\begin{itemize}
				808	\item \texttt{primitives}
				809	\item \texttt{thread pool manager}
				810	\item \texttt{timer manager}
				811	\end{itemize}
				812
				813	As mentioned above, we were hesitant to introduce any additional dependencies on
				814	thrift. We decided to use {\tt boost::shared\_ptr} because it is so useful for
				815	multithreaded application, because it requires no link-time or runtime libraries
				816	(ie it is a pure template library) and because it is become part of the C++0X
				817	standard.
				818
				819	We implement standard {\tt Mutex} and {\tt Condition} classes, and a
				820	{\tt Monitor} class. The latter is simply a combination of a mutex and
				821	condition variable and is analogous to the monitor implementation provided for
				822	all objects in java. This is also sometimes referred to as a barrier. We
				823	provide a {\tt Synchronized} guard class to allow java-like synchronized blocks.
				824	This is just a bit of syntactic sugar, but, like its java counterpart, clearly
				825	delimits critical sections of code. Unlike it's java counterpart, we still have
				826	the ability to programmatically lock, unlock, block, and signal monitors.
				827
				828	\begin{verbatim}
				829	void run() {
				830	{Synchronized s(manager->monitor);
				831	if (manager->state == TimerManager::STARTING) {
				832	manager->state = TimerManager::STARTED;
				833	manager->monitor.notifyAll();
				834	}
				835	}
				836	}
				837	\end{verbatim}
				838
				839	We again borrowed from java the distinction between a thread and a runnable
				840	class. A {\tt facebook::thread:Thread} is the actual schedulable object. The
				841	{\tt facebook::thread::Runnable} is the logic to execute within the thread.
				842	The {\tt Thread} implementation deals with all the platform-specific thread
				843	creation and destruction issues, while the {tt Runnable} implementation deals
				844	with the application-specific per-thread logic. . The benefit of this approach
				845	is that developers can easily subclass the Runnable class without pulling in
				846	platform-specific super-clases.
				847
				848	\subsection{Thread, Runnable, and shared\_ptr}
				849	We use {\tt boost::shared\_ptr} throughout the {\tt ThreadManager} and
				850	{\tt TimerManager} implementations to guarantee cleanup of dead objects that can
				851	be accessed by multiple threads. For {\tt Thread} class implementations,
				852	{\tt boost::shared\_ptr} usage requires particular attention to make sure
				853	{\tt Thread} objects are neither leaked nor dereferenced prematurely while
				854	creating and shutting down threads.
				855
				856	Thread creation requires calling into a C library. (In our case the POSIX
				857	thread library, libhthread, but the same would be true for WIN32 threads).
				858	Typically, the OS makes few if any guarantees about when a C thread's
				859	entry-point function, {\tt ThreadMain} will be called. Therefore, it is
				860	possible that our thread create call,
				861	{\tt facebook::thread::ThreadFactory::newThread()} could return to the caller
				862	well before that time. To ensure that the returned {\tt Thread} object is not
				863	prematurely cleaned up if the caller gives up its reference prior to the
				864	{\tt ThreadMain} call, the {\tt Thread} object makes a weak referenence to
				865	itself in its {\tt start} method.
				866
				867	With the weak reference in hand the {\tt ThreadMain} function can attempt to get
				868	a strong reference before entering the {\tt Runnable::run} method of the
				869	{\tt Runnable} object bound to the {\tt Thread}. If no strong refereneces to the
				870	thread obtained between exiting {\tt Thread::start} and entering the C helper
				871	function, {\tt ThreadMain}, the weak reference returns null and the function
				872	exits immediately.
				873
				874	The need for the {\tt Thread} to make a weak reference to itself has a
				875	significant impact on the API. Since references are managed through the
				876	{\tt boost::shared\_ptr} templates, the {\tt Thread} object must have a reference
				877	to itself wrapped by the same {\tt boost::shared\_ptr} envelope that is returned
				878	to the caller. This necessitated use of the factory pattern.
				879	{\tt ThreadFactory} creates the raw {\tt Thread} object and
				880	{tt boost::shared\_ptr} wrapper, and calls a private helper method of the class
				881	implementing the {\tt Thread} interface (in this case, {\tt PosixThread::weakRef}
				882	to allow it to make add weak reference to itself through the
				883	{\tt boost::shared\_ptr} envelope.
				884
				885	{\tt Thread} and {\tt Runnable} objects reference each other. A {\tt Runnable}
				886	object may need to know which thread it is executing in and a Thread, obviously,
				887	needs to know what {\tt Runnable} object it is hosting. This interdependency is
				888	further complicated because the lifecycle of each object is independent of the
				889	other. An application may create a set of {\tt Runnable} object to be used overs
				890	and over in different threads, or it may create and forget a {\tt Runnable} object
				891	once a thread has been created and started for it.
				892
				893	The {\tt Thread} class takes a {\tt boost::shared\_ptr} reference to the hosted
				894	{\tt Runnable} object in its contructor, while the {\tt Runnable} class has an
				895	explicit {\tt thread} method to allow explicit binding of the hosted thread.
				896	{\tt ThreadFactory::newThread} binds the two objects to each other.
				897
				898	\subsection{ThreadManager}
				899
				900	{\tt facebook::thread::ThreadManager} creates a pool of worker threads and
				901	allows applications to schedule tasks for execution as free worker threads
				902	become available. The {\tt ThreadManager} does not implement dynamic
				903	thread pool resizing, but provides primitives so that applications can add
				904	and remove threads based on load. This approach was chosen because
				905	implementing load metrics and thread pool size is very application
				906	specific. For example some applications may want to adjust pool size based
				907	on running-average of work arrival rates that are measured via polled
				908	samples. Others may simply wish to react immediately to work-queue
				909	depth high and low water marks. Rather than trying to create a complex
				910	API that is abstract enough to capture these different approaches, we
				911	simply leave it up to the particular application and provide the
				912	primitives to enact the desired policy and sample current status.
				913
				914	\subsection{TimerManager}
				915
				916	{\tt facebook::thread::TimerManager} applows applications to schedule
				917	{\tt Runnable} object execution at some point in the future. Its specific task
				918	is to allows applications to sample {\tt ThreadManager} load at regular
				919	intervals and make changes to the thread pool size based on application policy.
				920	Of course, it can be used to generate any number of timer or alarm events.
				921
				922	The default implementation of {\tt TimerManager} uses a single thread to
				923	execute expired {\tt Runnable} objects. Thus, if a timer operation needs to
				924	do a large amount of work and especially if it needs to do blocking I/O,
				925	that should be done in a separate thread.
Mark Slee	24b49d3	2007-03-21 01:24:00 +0000	[diff] [blame]	926
				927	\subsection{Nonblocking Operation}
				928	Though the Thrift transport interfaces map more directly to a blocking I/O
				929	model, we have implemented a high performance \texttt{TNonBlockingServer}
				930	in C++ based upon \texttt{libevent} and the \texttt{TFramedTransport}. We
				931	implemented this by moving all I/O into one tight event loop using a
				932	state machine. Essentially, the event loop reads framed requests into
				933	\texttt{TMemoryBuffer} objects. Once entire requests are ready, they are
				934	dispatched to the \texttt{TProcessor} object which can read directly from
				935	the data in memory.
				936
				937	\subsection{Compiler}
				938	The Thrift compiler is implemented in C++ using standard lex/yacc style
				939	tokenization and parsing. Though it could have been implemented with fewer
				940	lines of code in another language (i.e. Python/PLY or ocamlyacc), using C++
				941	forces explicit definition of the language constructs. Strongly typing the
				942	parse tree elements (debatably) makes the code more approachable for new
				943	developers.
				944
				945	Code generation is done using two passes. The first pass looks only for
				946	include files and type definitions. Type definitions are not checked during
				947	this phase, since they may depend upon include files. All included files
				948	are sequentially scanned in a first pass. Once the include tree has been
				949	resolved, a second pass is taken over all files which inserts type definitions
				950	into the parse tree and raises an error on any undefined types. The program is
				951	then generated against the parse tree.
				952
				953	Due to inherent complexities and potential for circular dependencies,
				954	we explicitly disallow forward declaration. Two Thrift structs cannot
				955	each contain an instance of the other. (Since we do not allow \texttt{null}
				956	struct instances in the generated C++ code, this would actually be impossible.)
				957
Aditya Agarwal	af524ee	2007-03-31 08:28:06 +0000	[diff] [blame]	958	\subsection{TFileTransport}
				959	The \texttt{TFileTransport} logs thrift requests/structs by
				960	framing incoming data with its length and writing it to disk.
				961	Using a framed on-disk format allows for better error checking and
				962	helps with processing a finite number of discrete events. The
				963	\texttt{TFileWriterTransport} uses a system of swapping in-memory buffers
				964	to ensure good performance while logging large amounts of data.
				965	A thrift logfile is split up into chunks of a speficified size and logged messages
				966	are not allowed to cross chunk boundaries. A message that would cross a chunk
				967	boundary will cause padding to be added until the end of the chunk and the
				968	first byte of the message is aligned to the beginning of the new chunk.
				969	Partitioning the file into chunks makes it possible to read and interpret data
				970	from a particular point in the file.
				971
Aditya Agarwal	adf3e7f	2007-03-31 16:56:14 +0000	[diff] [blame]	972	\section{Facebook thrift-based services}
				973	Thrift has been employed in a large number of applications at Facebook, including
				974	search, logging, mobile, ads and platform. Two specific usages are discussed below.
				975
				976	\subsection{Search}
				977	Thrift is used as the underlying protocol and transport for the Facebook seach service.
				978	The multi-language code generation is well suited for search because it allows application
				979	development in an efficient server side language (C++) and allows the Facebook PHP-based web application
				980	to make calls to the search service using Thrift PHP libraries. There is also a large
				981	variety of search stats, deployment and testing functionality that is built on top
				982	of the generated python code. In addition to this, the Thrift logfile format is
				983	used as a redolog for providing real-time search index updates. Thrift has allowed the
				984	search team to leverage each language for its strengths and to develop code at a rapid pace.
				985
				986	\subsection{Logging}
				987	The Thrift \texttt{TFileTransport} functionality is used for structured logging. Each
				988	service function definition along with its parameters can be considered to be
				989	a structured log entry identified by the function name. This log can then be used for
				990	a variety of purposes, including inline and offline processing, stats aggregation and as a redolog.
				991
Mark Slee	24b49d3	2007-03-21 01:24:00 +0000	[diff] [blame]	992	\section{Conclusions}
				993	Thrift has enabled Facebook to build scalable backend
				994	services efficiently by enabling engineers to divide and conquer. Application
				995	developers can focus upon application code without worrying about the
				996	sockets layer. We avoid duplicated work by writing buffering and I/O logic
				997	in one place, rather than interspersing it in each application.
				998
				999	Thrift has been employed in a wide variety of applications at Facebook,
				1000	including search, logging, mobile, ads, and platform. We have
				1001	found that the marginal performance cost incurred by an extra layer of
				1002	software abstraction is eclipsed by the gains in developer efficiency and
				1003	systems reliability.
				1004
				1005	\appendix
				1006
				1007	\section{Similar Systems}
				1008	The following are software systems similar to Thrift. Each is (very!) briefly
				1009	described:
				1010
				1011	\begin{itemize}
				1012	\item \textit{SOAP.} XML-based. Designed for web services via HTTP, excessive
				1013	XML parsing overhead.
				1014	\item \textit{CORBA.} Relatively comprehensive, debatably overdesigned and
				1015	heavyweight. Comparably cumbersome software installation.
				1016	\item \textit{COM.} Embraced mainly in Windows client softare. Not an entirely
				1017	open solution.
				1018	\item \textit{Pillar.} Lightweight and high-performance, but missing versioning
				1019	and abstraction.
				1020	\item \textit{Protocol Buffers.} Closed-source, owned by Google. Described in
				1021	Sawzall paper.
				1022	\end{itemize}
				1023
				1024	\acks
				1025
				1026	Many thanks for feedback on Thrift (and extreme trial by fire) are due to
Aditya Agarwal	af524ee	2007-03-31 08:28:06 +0000	[diff] [blame]	1027	Martin Smith, Karl Voskuil and Yishan Wong.
Mark Slee	24b49d3	2007-03-21 01:24:00 +0000	[diff] [blame]	1028
				1029	Thrift is a successor to Pillar, a similar system developed
				1030	by Adam D'Angelo, first while at Caltech and continued later at Facebook.
				1031	Thrift simply would not have happened without Adam's insights.
				1032
Marc Slemko	10b3bdb	2007-04-01 09:14:05 +0000	[diff] [blame]	1033	\begin{thebibliography}{}
Mark Slee	24b49d3	2007-03-21 01:24:00 +0000	[diff] [blame]	1034
Marc Slemko	10b3bdb	2007-04-01 09:14:05 +0000	[diff] [blame]	1035	\bibitem{boost.threads}
				1036	Kempf, William,
				1037	``Boost.Threads'',
				1038	\url{http://www.boost.org/doc/html/threads.html}
Mark Slee	24b49d3	2007-03-21 01:24:00 +0000	[diff] [blame]	1039
Marc Slemko	10b3bdb	2007-04-01 09:14:05 +0000	[diff] [blame]	1040	\bibitem{boost.threadpool}
				1041	Henkel, Philipp,
				1042	``threadpool'',
				1043	\url{http://threadpool.sourceforge.net}
				1044
				1045	\end{thebibliography}
Mark Slee	24b49d3	2007-03-21 01:24:00 +0000	[diff] [blame]	1046
				1047	\end{document}