Add a batch scanner that can be used directly for the whole table

### Search before asking

- [x] I searched in the [issues](https://github.com/apache/fluss/issues) and found nothing similar.


### Motivation
#### Iceberg & Arrow 's Behavior 
Apache Arrow:

```python
import pyarrow as pa

# Iterate over data in batches
for batch in table.to_batches():
    print(batch.to_pandas())
```

Apache Iceberg:
```java
org.apache.iceberg.Scanner<Record> scanner = icebergTable.newScan().limit(100).build();
scanner.forEach(record -> ...);
```

Both libraries provide an efficient way to iterate over large datasets using batched access.

#### ❌ Limitation in Fluss
In Fluss, if you want to perform a table-level scan with LIMIT or projection, you have to manually iterate through each bucket and create individual scanners for each one, even if the table is not partitioned.

Example current workaround:

```java
try (Connection connection = ConnectionFactory.createConnection(flussConfig)) {
    Table table = connection.getTable(tablePath);
    Admin flussAdmin = connection.getAdmin();

    // Get table info and generate list of buckets
    TableInfo tableInfo = flussAdmin.getTableInfo(tablePath).get();
    int bucketCount = tableInfo.getNumBuckets();
    List<TableBucket> tableBuckets;
     if (tableInfo.isPartitioned()) {
                List<PartitionInfo> partitionInfos = flussAdmin.listPartitionInfos(tablePath).get();
                tableBuckets =
                        partitionInfos.stream()
                                .flatMap(
                                        partitionInfo ->
                                                IntStream.range(0, bucketCount)
                                                        .mapToObj(
                                                                bucketId ->
                                                                        new TableBucket(
                                                                                tableInfo
                                                                                        .getTableId(),
                                                                                partitionInfo
                                                                                        .getPartitionId(),
                                                                                bucketId)))
                                .collect(Collectors.toList());
            } else {
                tableBuckets =
                        IntStream.range(0, bucketCount)
                                .mapToObj(
                                        bucketId ->
                                                new TableBucket(tableInfo.getTableId(), bucketId))
                                .collect(Collectors.toList());
            }

    Scan scan = table.newScan().limit(limit).project(projectedFields);
    List<BatchScanner> scanners = 
        tableBuckets.stream()
                   .map(scan::createBatchScanner)
                   .collect(Collectors.toList());

    List<InternalRow> scannedRows = BatchScanUtils.collectLimitedRows(scanners, limit);
}
```
This approach is not intuitive for users

###  Proposed Solution
We recommend introducing a createBatchScanner() method at the table level, similar to how Iceberg and Arrow do it.

✅ Expected Usage
```java
Table table = connection.getTable(tablePath);

// Create a batch scanner that can be used directly for the whole table
BatchScanner batchScanner =
    table.newScan()
         .project(projectedFields)
         .limit(limit)
         .createBatchScanner(); // <-- New API

// Iterate over batches
while (batchScanner.hasNext()) {
    InternalRow row = batchScanner.next();
    // process row
}

// or get all the rows at once
List<InternalRow> actualRows = BatchScanUtils.collectRows(batchScanner);
```

### Anything else?

_No response_

### Willingness to contribute

- [x] I'm willing to submit a PR!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add a batch scanner that can be used directly for the whole table #2793

Search before asking

Motivation

Iceberg & Arrow 's Behavior

❌ Limitation in Fluss

Proposed Solution

Anything else?

Willingness to contribute

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Add a batch scanner that can be used directly for the whole table #2793

Description

Search before asking

Motivation

Iceberg & Arrow 's Behavior

❌ Limitation in Fluss

Proposed Solution

Anything else?

Willingness to contribute

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions