Skip to content

Image showing the Polars Cloud logo

Introducing Polars Cloud

Polars Cloud is in alpha stage

Polars Cloud is currently available to a select group of individuals and companies for early-stage testing. You can learn more about Polars Cloud and its goals in our recent announcement post.

DataFrame implementations always differed from SQL and databases. SQL could run anywhere from embedded databases to massive data warehouses. Yet, DataFrame users have been forced to choose between a solution for local work or solutions geared towards distributed computing, each with their own APIs and limitations.

Polars is bridging this gap with Polars Cloud. Build on top of the popular open source project, Polars Cloud enables you to write DataFrame code once and run it anywhere. The distributed engine available with Polars Cloud allows to scale your Polars queries beyond a single machine.

Key Features of Polars Cloud

  • Unified DataFrame Experience: Run a Polars query seamlessly on your local machine and at scale with our new distributed engine. All from the same API.
  • Serverless Compute: Effortlessly start compute resources without managing infrastructure with options to execute queries on both CPU and GPU.
  • Any Environment: Start a remote query from a notebook on your machine, Airflow DAG, AWS Lambda, or your server. Get the flexibility to embed Polars Cloud in any environment.

Install Polars Cloud

Simply extend the capabilities of Polars with:

pip install polars polars_cloud

Example query

To run your query in the cloud, simply write Polars queries like you are used to, but call LazyFrame.remote() to indicate that the query should be run remotely.

ComputeContext ยท LazyFrameExt

import polars as pl
import polars_cloud as pc

ctx = pc.ComputeContext(cpus=16, memory=64)

query = (
    pl.scan_parquet("s3://my-dataset/")
    .group_by("l_returnflag", "l_linestatus")
    .agg(
        avg_price=pl.mean("l_extendedprice"),
        avg_disc=pl.mean("l_discount"),
        count_order=pl.len(),
    )
)

(
    query.remote(ctx)
    .sink_parquet("s3://my-dst/")
)

Sign up today and start for free

Polars Cloud is still in an early development stage and available at no additional cost. You only pay for the resources you use in your own cloud environment.

Cloud availability

Currently, Polars Cloud is available on AWS. Other cloud providers and on-premise solution is on the roadmap and will become available in the upcoming months.

AWS logo