Input/output#
Avro#
|
Read into a DataFrame from Apache Avro format. |
|
Write to Apache Avro file. |
Clipboard#
|
Read text from clipboard and pass to |
|
Copy |
CSV#
|
Read a CSV file into a DataFrame. |
|
Read a CSV file in batches. |
|
Lazily read from a CSV file or multiple files via glob patterns. |
|
Write to comma-separated values (CSV) file. |
|
Evaluate the query in streaming mode and write to a CSV file. |
Read |
Database#
|
Read the results of a SQL query into a DataFrame, given a connection object. |
|
Read the results of a SQL query into a DataFrame, given a URI. |
|
Write the data in a Polars DataFrame to a database. |
Delta Lake#
|
Reads into a DataFrame from a Delta lake table. |
|
Lazily read from a Delta lake table. |
|
Write DataFrame as delta table. |
Excel / ODS#
|
Read Excel spreadsheet data into a DataFrame. |
|
Read OpenOffice (ODS) spreadsheet data into a DataFrame. |
|
Write frame data to a table in an Excel workbook/worksheet. |
Feather / IPC#
|
Read into a DataFrame from Arrow IPC (Feather v2) file. |
|
Get the schema of an IPC file without reading data. |
|
Read into a DataFrame from Arrow IPC record batch stream. |
|
Lazily read from an Arrow IPC (Feather v2) file or multiple files via glob patterns. |
|
Write to Arrow IPC binary stream or Feather file. |
|
Write to Arrow IPC record batch stream. |
|
Evaluate the query in streaming mode and write to an IPC file. |
Iceberg#
|
Lazily read from an Apache Iceberg table. |
JSON#
|
Read into a DataFrame from a JSON file. |
|
Read into a DataFrame from a newline delimited JSON file. |
|
Lazily read from a newline delimited JSON file or multiple files via glob patterns. |
|
Serialize to JSON representation. |
|
Serialize to newline delimited JSON representation. |
|
Evaluate the query in streaming mode and write to an NDJSON file. |
Parquet#
|
Read into a DataFrame from a parquet file. |
|
Get the schema of a Parquet file without reading data. |
|
Lazily read from a local or cloud-hosted parquet file (or files). |
|
Write to Apache Parquet file. |
|
Evaluate the query in streaming mode and write to a Parquet file. |
PyArrow Datasets#
Connect to pyarrow datasets.
|
Scan a pyarrow dataset. |
Cloud Credentials#
Configuration for cloud credential provisioning.
Base class for credential providers. |
|
|
AWS Credential Provider. |
|
Azure Credential Provider. |
|
GCP Credential Provider. |