Expand description
DataFusion data sources: TableProvider
and ListingTable
Re-exports§
pub use self::default_table_source::provider_as_source;
pub use self::default_table_source::source_as_provider;
pub use self::default_table_source::DefaultTableSource;
pub use self::memory::MemTable;
pub use self::view::ViewTable;
Modules§
- avro_
to_ arrow - This module contains code for reading Avro data into
RecordBatch
es - cte_
worktable - CteWorkTable implementation used for recursive queries
- default_
table_ source - Default TableSource implementation used in DataFusion physical plans
- dynamic_
file - dynamic_file_schema contains an
UrlTableFactory
implementation that can create aListingTable
from the given url. - empty
EmptyTable
useful for testing.- file_
format - Module containing helper methods for the various file formats See write.rs for write related helper methods
- listing
- A table that uses the
ObjectStore
listing capability to get the list of files to process. - listing_
table_ factory - Factory for creating ListingTables with default options
- memory
MemTable
for queryingVec<RecordBatch>
by DataFusion.- object_
store - ObjectStoreRegistry holds all the object stores at Runtime with a scheme for each store. This allows the user to extend DataFusion with different storage systems such as S3 or HDFS and query data inside these systems.
- physical_
plan - Execution plans that read file formats
- provider
- Data source traits
- schema_
adapter SchemaAdapter
andSchemaAdapterFactory
to adapt file-level record batches to a table schema.- source
DataSource
andDataSourceExec
- stream
- TableProvider for stream sources, such as FIFO files
- view
- View data source which uses a LogicalPlan as it’s input.
Enums§
- Table
Type - Indicates the type of this table for metadata/catalog purposes.
Traits§
- Table
Provider - A table which can be queried and modified.
Functions§
- get_
statistics_ with_ limit - Get all files as well as the file level summary statistics (no statistic for partition columns).
If the optional
limit
is provided, includes only sufficient files. Needed to read up tolimit
number of rows.collect_stats
is passed down from the configuration parameter onListingTable
. If it is false we only construct bare statistics and skip a potentially expensive call tomultiunzip
for constructing file level summary statistics.