Input/output#
CSV#
|
Read a CSV file into a DataFrame. |
|
Read a CSV file in batches. |
|
Lazily read from a CSV file or multiple files via glob patterns. |
Write to comma-separated values (CSV) file. |
|
|
Evaluate the query in streaming mode and write to a CSV file. |
Feather/ IPC#
|
Read into a DataFrame from Arrow IPC (Feather v2) file. |
|
Read into a DataFrame from Arrow IPC record batch stream. |
|
Lazily read from an Arrow IPC (Feather v2) file or multiple files via glob patterns. |
|
Get the schema of an IPC file without reading data. |
Write to Arrow IPC binary stream or Feather file. |
|
Write to Arrow IPC record batch stream. |
|
|
Evaluate the query in streaming mode and write to an IPC file. |
Parquet#
|
Read into a DataFrame from a parquet file. |
|
Lazily read from a local or cloud-hosted parquet file (or files). |
|
Get the schema of a Parquet file without reading data. |
|
Write to Apache Parquet file. |
|
Evaluate the query in streaming mode and write to a Parquet file. |
Database#
Read the results of a SQL query into a DataFrame, given a connection object. |
|
|
Read the results of a SQL query into a DataFrame, given a URI. |
|
Write a polars frame to a database. |
JSON#
|
Read into a DataFrame from a JSON file. |
|
Read into a DataFrame from a newline delimited JSON file. |
|
Lazily read from a newline delimited JSON file or multiple files via glob patterns. |
Serialize to JSON representation. |
|
Serialize to newline delimited JSON representation. |
AVRO#
|
Read into a DataFrame from Apache Avro format. |
|
Write to Apache Avro file. |
Spreadsheet#
Read Excel (XLSX) spreadsheet data into a DataFrame. |
|
|
Read OpenOffice (ODS) spreadsheet data into a DataFrame. |
|
Write frame data to a table in an Excel workbook/worksheet. |
Apache Iceberg#
|
Lazily read from an Apache Iceberg table. |
Delta Lake#
|
Lazily read from a Delta lake table. |
|
Reads into a DataFrame from a Delta lake table. |
|
Write DataFrame as delta table. |
Datasets#
Connect to pyarrow datasets.
|
Scan a pyarrow dataset. |
BatchedCsvReader#
This reader comes available by calling pl.read_csv_batched
.
Read |