Struct RecordBatchPartitionSplitter

Source
pub struct RecordBatchPartitionSplitter { /* private fields */ }
Expand description

The splitter used to split the record batch into multiple record batches by the partition spec.

  1. It will project and transform the input record batch based on the partition spec, get the partitioned record batch.
  2. Split the input record batch into multiple record batches based on the partitioned record batch.

§Partition Value Modes

The splitter supports two modes for obtaining partition values:

  • Computed mode (calculator is Some): Computes partition values from source columns using transforms
  • Pre-computed mode (calculator is None): Expects a _partition column in the input batch

Implementations§

Source§

impl RecordBatchPartitionSplitter

Source

pub fn try_new( iceberg_schema: SchemaRef, partition_spec: PartitionSpecRef, calculator: Option<PartitionValueCalculator>, ) -> Result<Self>

Create a new RecordBatchPartitionSplitter.

§Arguments
  • iceberg_schema - The Iceberg schema reference
  • partition_spec - The partition specification reference
  • calculator - Optional calculator for computing partition values from source columns.
    • Some(calculator): Compute partition values from source columns using transforms
    • None: Expect a pre-computed _partition column in the input batch
§Returns

Returns a new RecordBatchPartitionSplitter instance or an error if initialization fails.

Source

pub fn try_new_with_computed_values( iceberg_schema: SchemaRef, partition_spec: PartitionSpecRef, ) -> Result<Self>

Create a new RecordBatchPartitionSplitter with computed partition values.

This is a convenience method that creates a calculator and initializes the splitter to compute partition values from source columns.

§Arguments
  • iceberg_schema - The Iceberg schema reference
  • partition_spec - The partition specification reference
§Returns

Returns a new RecordBatchPartitionSplitter instance or an error if initialization fails.

Source

pub fn try_new_with_precomputed_values( iceberg_schema: SchemaRef, partition_spec: PartitionSpecRef, ) -> Result<Self>

Create a new RecordBatchPartitionSplitter expecting pre-computed partition values.

This is a convenience method that initializes the splitter to expect a _partition column in the input batches.

§Arguments
  • iceberg_schema - The Iceberg schema reference
  • partition_spec - The partition specification reference
§Returns

Returns a new RecordBatchPartitionSplitter instance or an error if initialization fails.

Source

pub fn split( &self, batch: &RecordBatch, ) -> Result<Vec<(PartitionKey, RecordBatch)>>

Split the record batch into multiple record batches based on the partition spec.

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
§

impl<T> AsAny for T
where T: Any,

§

fn as_any(&self) -> &(dyn Any + 'static)

§

fn as_any_mut(&mut self) -> &mut (dyn Any + 'static)

§

fn type_name(&self) -> &'static str

Gets the type name of self
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
§

impl<T> Downcast for T
where T: AsAny + ?Sized,

§

fn is<T>(&self) -> bool
where T: AsAny,

Returns true if the boxed type is the same as T. Read more
§

fn downcast_ref<T>(&self) -> Option<&T>
where T: AsAny,

Forward to the method defined on the type Any.
§

fn downcast_mut<T>(&mut self) -> Option<&mut T>
where T: AsAny,

Forward to the method defined on the type Any.
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

§

impl<T> Instrument for T

§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided [Span], returning an Instrumented wrapper. Read more
§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
§

impl<T> Pointable for T

§

const ALIGN: usize

The alignment of pointer.
§

type Init = T

The type for initializers.
§

unsafe fn init(init: <T as Pointable>::Init) -> usize

Initializes a with the given initializer. Read more
§

unsafe fn deref<'a>(ptr: usize) -> &'a T

Dereferences the given pointer. Read more
§

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Mutably dereferences the given pointer. Read more
§

unsafe fn drop(ptr: usize)

Drops the object pointed to by the given pointer. Read more
§

impl<T> PolicyExt for T
where T: ?Sized,

§

fn and<P, B, E>(self, other: P) -> And<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns [Action::Follow] only if self and other return Action::Follow. Read more
§

fn or<P, B, E>(self, other: P) -> Or<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns [Action::Follow] if either self or other returns Action::Follow. Read more
Source§

impl<T> Same for T

Source§

type Output = T

Should always be Self
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

§

fn vzip(self) -> V

§

impl<T> WithSubscriber for T

§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a [WithDispatch] wrapper. Read more
§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a [WithDispatch] wrapper. Read more
§

impl<T> MaybeSend for T
where T: Send,