OurSQL Episode 40: TokuDB 5.0 part 2

This week we conclude the interview with Tokutek's co-founder and Chief Scientist, Martin Farach-Colton. Part 1 of the interview. TokuDB is a transactional, fully ACID-compliant storage engine that uses fractal trees for data and indexes, instead of MySQL's standard B-tree implementation. Combined with TokuDB's message-based architecture, TokuDB data and indexes do not fragment, have completely online column and index addition and removal and unlike InnoDB and XtraDB, do not fall apart when indexes no longer fit into memory.

Topics covered:
How TokuDB achieves consistency and durability, and what happens if MySQL or a server crashes.
Backups and TokuDB
Replication and TokuDB
How TokuDB does compression, and the impressive rates (10-12x usually)
Using TokuDB instead of partitioning, especially given the lack of fragmentation in TokuDB.
TokuDB's bulk loader - building a new table with lots of data and indexes with a fully parallel bulk loader
TokuDB and multi-threadedness
Cache-oblivious algorithmics
TokuDB server tuning and diagnostics with SHOW ENGINE TOKUDB STATUS (which is in table format, much better than the InnoDB equivalent)
Graphic novels - Logicomix was inspired by the epic story of the quest for the Foundations of Mathematics.
How to get TokuDB, licensing and how to compare TokuDB to InnoDB

Tokutek sessions at Collaborate:
Understanding Indexing without needing to know about data structures, by Zardosht Kasheff - Monday, April 11 from 10:30 am - 11:30 am
How Fractal Trees Speed Up Trickle Loading While Maintaining Indexes by Bradley Kuszmaul - Monday, April 11 from 2:30 pm - 3:30 pm
Exploiting Fast Indexing in MySQL by Bradley Kuszmaul - Monday, April 11 from 3:45 pm - 4:45 pm

Tokutek session at the O'Reilly MySQL Conference:
Understanding Indexing without needing to know about data structures, by Zardosht Kasheff - Tuesday, April 12 from 11:55 am - 12:40 pm

Ear Candy
In this week's ear candy we talk about atomic table rename.

Where you can see us
On Thursday, April 7, 2011 at 6:30 PM Giuseppe Maxia will speak at the San Francisco MySQL User Group about "Advanced Replication for the Masses". The topic will cover how to do advanced replication tasks not possible with the standard MySQL build, such as allowing a slave to have more than one master and having a multi-threaded process to apply SQL statements to the slave.

Sheeri will be at Collaborate in Orlando, Florida from Sunday April 10th through Thursday April 14th. She is organizing the Community dinner, eastern US edition on Sunday April 10th at 5 pm at Maggiano's Little Italy near the Convention Center.

Sarah will be at the O'Reilly MySQL Conference in Santa Clara, California from Monday, April 11th through Thursday April 14th, including the community dinner, western US edition on Monday April 11th at 7 pm at Pedro's.

Sheeri will be speaking about monitoring MySQL efficiently with Nagios at the Professional IT Community Conference, otherwise known as PICC, in New Brunswick, New Jersey. The conference is Friday, April 29 – Saturday April 30, 2011.

Sheeri will be at OpenDBCamp, Fri May 6th - Sun May 8th in Sardinia, Italy.

Sarah will be at the Velocity Conference, Tuesday June 14 - 16 in Santa Clara, California, speaking about Where is your data cached (and where should it be cached)?.

Feedback
Facebook group
e-mail: podcast at technocation.org
voicemail using phone/Skype: +1-617-674-2369
twitter: @oursqlcast
or Tweet about @oursqlcast