Struct polars_io::parquet::read::ParquetReader
source · pub struct ParquetReader<R: Read + Seek> { /* private fields */ }
Available on crate feature
parquet
only.Expand description
Read Apache parquet format into a DataFrame.
Implementations§
source§impl<R: MmapBytesReader> ParquetReader<R>
impl<R: MmapBytesReader> ParquetReader<R>
sourcepub fn set_low_memory(self, low_memory: bool) -> Self
pub fn set_low_memory(self, low_memory: bool) -> Self
Try to reduce memory pressure at the expense of performance. If setting this does not reduce memory enough, turn off parallelization.
sourcepub fn read_parallel(self, parallel: ParallelStrategy) -> Self
pub fn read_parallel(self, parallel: ParallelStrategy) -> Self
Read the parquet file in parallel (default). The single threaded reader consumes less memory.
sourcepub fn with_n_rows(self, num_rows: Option<usize>) -> Self
pub fn with_n_rows(self, num_rows: Option<usize>) -> Self
Stop reading at num_rows
rows.
sourcepub fn with_columns(self, columns: Option<Vec<String>>) -> Self
pub fn with_columns(self, columns: Option<Vec<String>>) -> Self
Columns to select/ project
sourcepub fn with_projection(self, projection: Option<Vec<usize>>) -> Self
pub fn with_projection(self, projection: Option<Vec<usize>>) -> Self
Set the reader’s column projection. This counts from 0, meaning that
vec![0, 4]
would select the 1st and 5th column.
sourcepub fn with_row_index(self, row_index: Option<RowIndex>) -> Self
pub fn with_row_index(self, row_index: Option<RowIndex>) -> Self
Add a row index column.
sourcepub fn with_schema(self, schema: Option<ArrowSchemaRef>) -> Self
pub fn with_schema(self, schema: Option<ArrowSchemaRef>) -> Self
Set the Schema
if already known. This must be exactly the same as
the schema in the file itself.
sourcepub fn use_statistics(self, toggle: bool) -> Self
pub fn use_statistics(self, toggle: bool) -> Self
Use statistics in the parquet to determine if pages can be skipped from reading.
pub fn with_hive_partition_columns(self, columns: Option<Vec<Series>>) -> Self
pub fn get_metadata(&mut self) -> PolarsResult<&FileMetaDataRef>
pub fn with_predicate(self, predicate: Option<Arc<dyn PhysicalIoExpr>>) -> Self
source§impl<R: MmapBytesReader + 'static> ParquetReader<R>
impl<R: MmapBytesReader + 'static> ParquetReader<R>
pub fn batched(self, chunk_size: usize) -> PolarsResult<BatchedParquetReader>
Trait Implementations§
source§impl<R: MmapBytesReader> SerReader<R> for ParquetReader<R>
impl<R: MmapBytesReader> SerReader<R> for ParquetReader<R>
source§fn new(reader: R) -> Self
fn new(reader: R) -> Self
Create a new ParquetReader
from an existing Reader
.
source§fn set_rechunk(self, rechunk: bool) -> Self
fn set_rechunk(self, rechunk: bool) -> Self
Make sure that all columns are contiguous in memory by
aggregating the chunks into a single array.
Auto Trait Implementations§
impl<R> Freeze for ParquetReader<R>where
R: Freeze,
impl<R> !RefUnwindSafe for ParquetReader<R>
impl<R> Send for ParquetReader<R>where
R: Send,
impl<R> Sync for ParquetReader<R>where
R: Sync,
impl<R> Unpin for ParquetReader<R>where
R: Unpin,
impl<R> !UnwindSafe for ParquetReader<R>
Blanket Implementations§
source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
§impl<T> Instrument for T
impl<T> Instrument for T
§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
source§impl<T> IntoEither for T
impl<T> IntoEither for T
source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self
into a Left
variant of Either<Self, Self>
if into_left
is true
.
Converts self
into a Right
variant of Either<Self, Self>
otherwise. Read moresource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self
into a Left
variant of Either<Self, Self>
if into_left(&self)
returns true
.
Converts self
into a Right
variant of Either<Self, Self>
otherwise. Read more