# Data IO (Python functions)

A TFRecords file represents a sequence of (binary) strings. The format is not random access, so it is suitable for streaming large amounts of data but not suitable if fast sharding or other non-sequential access is desired.

## TFRecords Format Details

A TFRecords file contains a sequence of strings with CRC hashes. Each record has the format

uint64 length

masked_crc = ((crc >> 15) | (crc << 17)) + 0xa282ead8ul