lance-format
diff --git a/‎docs/src/.pages‎
Lines changed: 1 addition & 0 deletions b/‎docs/src/.pages‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎docs/src/config.md‎
Lines changed: 11 additions & 0 deletions b/‎docs/src/config.md‎
Lines changed: 11 additions & 0 deletions
diff --git a/‎docs/src/performance.md‎
Lines changed: 134 additions & 0 deletions b/‎docs/src/performance.md‎
Lines changed: 134 additions & 0 deletions
diff --git a/‎plugin/trino-lance/pom.xml‎
Lines changed: 9 additions & 1 deletion b/‎plugin/trino-lance/pom.xml‎
Lines changed: 9 additions & 1 deletion
diff --git a/‎plugin/trino-lance/src/main/java/io/trino/plugin/lance/LanceArrowToPageScanner.java‎
Lines changed: 8 additions & 4 deletions b/‎plugin/trino-lance/src/main/java/io/trino/plugin/lance/LanceArrowToPageScanner.java‎
Lines changed: 8 additions & 4 deletions
diff --git a/‎plugin/trino-lance/src/main/java/io/trino/plugin/lance/LanceBasePageSource.java‎
Lines changed: 6 additions & 13 deletions b/‎plugin/trino-lance/src/main/java/io/trino/plugin/lance/LanceBasePageSource.java‎
Lines changed: 6 additions & 13 deletions
diff --git a/‎plugin/trino-lance/src/main/java/io/trino/plugin/lance/LanceConfig.java‎
Lines changed: 89 additions & 2 deletions b/‎plugin/trino-lance/src/main/java/io/trino/plugin/lance/LanceConfig.java‎
Lines changed: 89 additions & 2 deletions
@@ -2,4 +2,5 @@ nav:
   - Welcome: index.md
   - Install: install.md
   - Config: config.md
+  - Performance: performance.md
   - Operations: operations
@@ -68,6 +68,17 @@ CREATE TABLE lance.myschema.users (id BIGINT, name VARCHAR);
 | `lance.single_level_ns` | Enable single-level mode with virtual `default` schema | `false` |
 | `lance.parent` | Parent namespace prefix (levels separated by `$`) | - |
 
+## Read and Write Settings
+
+Control batch sizes for vectorized operations:
+
+| Property | Description | Default |
+|----------|-------------|---------|
+| `lance.read_batch_size` | Rows per batch during vectorized reads | `8192` |
+| `lance.write_batch_size` | Rows to batch before writing to Arrow | `10000` |
+| `lance.max_rows_per_file` | Maximum rows per Lance file | `1000000` |
+| `lance.max_rows_per_group` | Maximum rows per row group | `100000` |
+
 ## Examples
 
 ### Directory Namespace
 
@@ -0,0 +1,134 @@
+# Performance Tuning
+
+This guide covers performance tuning for Lance Trino operations in large-scale analytics scenarios.
+
+## Understanding Lance's Default Optimization
+
+Lance is **optimized by default for random access patterns** - fast point lookups, vector searches, and selective column reads.
+These defaults work well for ML/AI workloads where you frequently access individual records or small batches.
+
+For **large-scale batch ETL and scan-heavy OLAP operations** (writing millions of rows, full table scans, bulk exports),
+you can tune Lance's environment variables and connector properties to better utilize available resources.
+
+## Caching
+
+Lance Trino uses a multi-level caching strategy to minimize redundant I/O and improve query performance.
+
+### How Caching Works
+
+Lance Trino implements two levels of caching:
+
+1. **Session Cache** - Contains index and metadata caches:
+
+    - **Index Cache**: Caches opened vector indices, fragment reuse indices, and index metadata
+    - **Metadata Cache**: Caches manifests, transactions, deletion files, row ID indices, and file metadata
+
+2. **Dataset Cache** - Caches opened datasets by `(userIdentity, tablePath, version)` key. Since a dataset at a specific version is immutable, this ensures:
+
+    - Each dataset is opened only once per worker
+    - All workers read the same version for snapshot isolation
+    - Schema and fragment metadata are reused from the cached dataset
+
+### Cache Configuration
+
+Configure caching behavior via connector properties in your catalog file:
+
+```properties
+# Session cache settings
+lance.cache.session.max_entries=100                       # Maximum cached sessions (default: 100)
+lance.cache.session.ttl_minutes=60                        # Session cache TTL in minutes (default: 60)
+lance.cache.session.index_cache_size_bytes=6442450944     # Index cache size: 6GB
+lance.cache.session.metadata_cache_size_bytes=1073741824  # Metadata cache size: 1GB
+
+# Dataset cache settings
+lance.cache.dataset.max_entries=100          # Maximum cached datasets (default: 100)
+lance.cache.dataset.ttl_minutes=30           # Dataset cache TTL in minutes (default: 30)
+```
+
+| Property | Description | Default |
+|----------|-------------|---------|
+| `lance.cache.session.max_entries` | Maximum number of cached sessions | `100` |
+| `lance.cache.session.ttl_minutes` | Session cache TTL in minutes | `60` |
+| `lance.cache.session.index_cache_size_bytes` | Index cache size in bytes | Lance default (6GB) |
+| `lance.cache.session.metadata_cache_size_bytes` | Metadata cache size in bytes | Lance default (1GB) |
+| `lance.cache.dataset.max_entries` | Maximum number of cached datasets | `100` |
+| `lance.cache.dataset.ttl_minutes` | Dataset cache TTL in minutes | `30` |
+
+The index cache stores vector indices which can be large but provide significant speedup for vector search queries.
+Increase this if you frequently query tables with vector indices.
+
+The metadata cache stores manifests, file metadata, and other dataset metadata.
+Each column's metadata can be around 40MB, so increase this if your tables have many columns.
+
+## Lance Environment Variables
+
+Lance uses environment variables for low-level I/O tuning. Set these on your Trino coordinator and worker nodes.
+
+### Read Performance
+
+#### I/O Threads
+
+Set via environment variable `LANCE_IO_THREADS` (default: 64).
+
+Controls the number of I/O threads used for parallel reads from storage.
+For large scans, increasing this to match your CPU core count enables more concurrent S3 requests.
+
+```bash
+export LANCE_IO_THREADS=128
+```
+
+### Write Performance
+
+#### Upload Concurrency
+
+Set via environment variable `LANCE_UPLOAD_CONCURRENCY` (default: 10).
+
+Controls the number of concurrent multipart upload streams to S3.
+Increasing this to match your CPU core count can improve throughput.
+
+```bash
+export LANCE_UPLOAD_CONCURRENCY=32
+```
+
+#### Upload Part Size
+
+Set via environment variable `LANCE_INITIAL_UPLOAD_SIZE` (default: 5MB).
+
+Controls the initial part size for S3 multipart uploads.
+Larger part sizes reduce the number of API calls and can improve throughput for large writes.
+However, larger part sizes use more memory and may increase latency for small writes.
+Use the default for interactive workloads.
+
+!!!note
+    Lance automatically increments the multipart upload size by 5MB every 100 uploads,
+    so large file writes progressively use increasingly large upload parts.
+    There is no configuration for a fixed upload size.
+
+```bash
+export LANCE_INITIAL_UPLOAD_SIZE=33554432  # 32MB
+```
+
+### Environment Variables Summary
+
+| Variable | Description | Default |
+|----------|-------------|---------|
+| `LANCE_IO_THREADS` | Number of I/O threads for parallel reads | `64` |
+| `LANCE_UPLOAD_CONCURRENCY` | Number of concurrent S3 upload streams | `10` |
+| `LANCE_INITIAL_UPLOAD_SIZE` | Initial S3 multipart upload part size (bytes) | `5242880` (5MB) |
+
+## Index-Aware Split Planning
+
+Lance Trino optimizes split planning based on index availability. When a table has indexes on filtered columns, larger splits are used because index lookups are efficient.
+
+```properties
+# Rows per split when btree index is used (default: 100M)
+lance.index.btree.rows_per_split=100000000
+
+# Rows per split when bitmap index is used (default: 10M)
+lance.index.bitmap.rows_per_split=10000000
+```
+
+| Property | Description | Default |
+|----------|-------------|---------|
+| `lance.index.btree.rows_per_split` | Row count threshold for btree-indexed splits | `100000000` (100M) |
+| `lance.index.bitmap.rows_per_split` | Row count threshold for bitmap-indexed splits | `10000000` (10M) |
@@ -108,7 +108,7 @@
         <dependency>
             <groupId>org.lance</groupId>
             <artifactId>lance-core</artifactId>
-            <version>2.0.0</version>
+            <version>3.0.0-beta.4</version>
             <exclusions>
                 <exclusion>
                     <groupId>org.junit.jupiter</groupId>
@@ -418,7 +418,15 @@
                     <ignoredUnusedDeclaredDependencies combine.children="append">
                         <!-- Required at compile time for substrait Expression class hierarchy -->
                         <ignoredUnusedDeclaredDependency>com.google.protobuf:protobuf-java</ignoredUnusedDeclaredDependency>
+                        <!-- Currently unused but kept for potential future use -->
+                        <ignoredUnusedDeclaredDependency>io.trino:trino-cache</ignoredUnusedDeclaredDependency>
+                        <!-- Used for @SuppressModernizer annotation -->
+                        <ignoredUnusedDeclaredDependency>org.gaul:modernizer-maven-annotations</ignoredUnusedDeclaredDependency>
                     </ignoredUnusedDeclaredDependencies>
+                    <ignoredUsedUndeclaredDependencies>
+                        <!-- Provided transitively by the parent pom -->
+                        <ignoredUsedUndeclaredDependency>org.gaul:modernizer-maven-annotations</ignoredUsedUndeclaredDependency>
+                    </ignoredUsedUndeclaredDependencies>
                 </configuration>
             </plugin>
             <plugin>
 
@@ -94,9 +94,11 @@ public LanceArrowToPageScanner(
             ScannerFactory scannerFactory,
             Map<String, String> storageOptions,
             Optional<ByteBuffer> substraitFilter,
-            OptionalLong limit)
+            OptionalLong limit,
+            String userIdentity,
+            Long datasetVersion)
     {
-        this(allocator, path, columns, List.of(), scannerFactory, storageOptions, substraitFilter, limit);
+        this(allocator, path, columns, List.of(), scannerFactory, storageOptions, substraitFilter, limit, userIdentity, datasetVersion);
     }
 
     public LanceArrowToPageScanner(
@@ -107,7 +109,9 @@ public LanceArrowToPageScanner(
             ScannerFactory scannerFactory,
             Map<String, String> storageOptions,
             Optional<ByteBuffer> substraitFilter,
-            OptionalLong limit)
+            OptionalLong limit,
+            String userIdentity,
+            Long datasetVersion)
     {
         this.allocator = requireNonNull(allocator, "allocator is null");
         requireNonNull(columns, "columns is null");
@@ -148,7 +152,7 @@ public LanceArrowToPageScanner(
             }
         }
 
-        lanceScanner = scannerFactory.open(path, allocator, projectionColumns, storageOptions, substraitFilter, limit);
+        lanceScanner = scannerFactory.open(path, allocator, projectionColumns, storageOptions, substraitFilter, limit, userIdentity, datasetVersion);
         this.arrowReader = lanceScanner.scanBatches();
         try {
             this.vectorSchemaRoot = arrowReader.getVectorSchemaRoot();
 
@@ -19,13 +19,11 @@
 import io.trino.spi.connector.ConnectorPageSource;
 import org.apache.arrow.memory.BufferAllocator;
 import org.apache.arrow.memory.RootAllocator;
-import org.apache.arrow.util.VisibleForTesting;
 
 import java.util.List;
 import java.util.Map;
 import java.util.concurrent.atomic.AtomicBoolean;
 import java.util.concurrent.atomic.AtomicLong;
-import java.util.stream.Collectors;
 
 import static com.google.common.base.Preconditions.checkState;
 import static com.google.common.collect.ImmutableList.toImmutableList;
@@ -46,12 +44,12 @@ public abstract class LanceBasePageSource
     protected final BufferAllocator bufferAllocator;
     protected final PageBuilder pageBuilder;
 
-    public LanceBasePageSource(LanceTableHandle tableHandle, List<LanceColumnHandle> columns, ScannerFactory scannerFactory, Map<String, String> storageOptions)
+    public LanceBasePageSource(LanceTableHandle tableHandle, List<LanceColumnHandle> columns, ScannerFactory scannerFactory, Map<String, String> storageOptions, String userIdentity)
     {
-        this(tableHandle, columns, List.of(), scannerFactory, storageOptions);
+        this(tableHandle, columns, List.of(), scannerFactory, storageOptions, userIdentity);
     }
 
-    public LanceBasePageSource(LanceTableHandle tableHandle, List<LanceColumnHandle> columns, List<String> filterProjectionColumns, ScannerFactory scannerFactory, Map<String, String> storageOptions)
+    public LanceBasePageSource(LanceTableHandle tableHandle, List<LanceColumnHandle> columns, List<String> filterProjectionColumns, ScannerFactory scannerFactory, Map<String, String> storageOptions, String userIdentity)
     {
         this.tableHandle = tableHandle;
         this.bufferAllocator = allocator.newChildAllocator(tableHandle.getTableName(), 1024, Long.MAX_VALUE);
@@ -66,7 +64,9 @@ public LanceBasePageSource(LanceTableHandle tableHandle, List<LanceColumnHandle>
                             scannerFactory,
                             storageOptions,
                             tableHandle.getSubstraitFilterBuffer(),
-                            tableHandle.getLimit());
+                            tableHandle.getLimit(),
+                            userIdentity,
+                            tableHandle.getDatasetVersion());
         }
         catch (RuntimeException e) {
             // Handle concurrent modification errors (e.g., fragment not found due to concurrent update)
@@ -101,13 +101,6 @@ private static boolean isConcurrentModificationError(RuntimeException e)
         return false;
     }
 
-    @VisibleForTesting
-    public static List<LanceColumnHandle> toColumnHandles(LanceTableHandle tableHandle, Map<String, String> storageOptions)
-    {
-        return LanceDatasetCache.getColumnHandles(tableHandle.getTablePath(), storageOptions).values().stream()
-                .map(c -> (LanceColumnHandle) c).collect(Collectors.toList());
-    }
-
     @Override
     public long getCompletedBytes()
     {
 
@@ -178,7 +178,7 @@ public long getBtreeIndexedRowsPerSplit()
         return btreeIndexedRowsPerSplit;
     }
 
-    @Config("lance.btree_indexed_rows_per_split")
+    @Config("lance.index.btree.rows_per_split")
     @ConfigDescription("Row count threshold for grouping btree-indexed fragments per split (default 100M)")
     public LanceConfig setBtreeIndexedRowsPerSplit(long btreeIndexedRowsPerSplit)
     {
@@ -191,11 +191,98 @@ public long getBitmapIndexedRowsPerSplit()
         return bitmapIndexedRowsPerSplit;
     }
 
-    @Config("lance.bitmap_indexed_rows_per_split")
+    @Config("lance.index.bitmap.rows_per_split")
     @ConfigDescription("Row count threshold for grouping bitmap-indexed fragments per split (default 10M)")
     public LanceConfig setBitmapIndexedRowsPerSplit(long bitmapIndexedRowsPerSplit)
     {
         this.bitmapIndexedRowsPerSplit = bitmapIndexedRowsPerSplit;
         return this;
     }
+
+    // ===== Cache Configuration =====
+
+    private int cacheSessionMaxEntries = 100;
+    private int cacheSessionTtlMinutes = 60;
+    private Long cacheSessionIndexCacheSizeBytes;  // null = use Lance default
+    private Long cacheSessionMetadataCacheSizeBytes;  // null = use Lance default
+    private int cacheDatasetMaxEntries = 100;
+    private int cacheDatasetTtlMinutes = 30;
+
+    public int getCacheSessionMaxEntries()
+    {
+        return cacheSessionMaxEntries;
+    }
+
+    @Config("lance.cache.session.max_entries")
+    @ConfigDescription("Maximum number of cached sessions (default 100)")
+    public LanceConfig setCacheSessionMaxEntries(int cacheSessionMaxEntries)
+    {
+        this.cacheSessionMaxEntries = cacheSessionMaxEntries;
+        return this;
+    }
+
+    public int getCacheSessionTtlMinutes()
+    {
+        return cacheSessionTtlMinutes;
+    }
+
+    @Config("lance.cache.session.ttl_minutes")
+    @ConfigDescription("Session cache TTL in minutes (default 60)")
+    public LanceConfig setCacheSessionTtlMinutes(int cacheSessionTtlMinutes)
+    {
+        this.cacheSessionTtlMinutes = cacheSessionTtlMinutes;
+        return this;
+    }
+
+    public Long getCacheSessionIndexCacheSizeBytes()
+    {
+        return cacheSessionIndexCacheSizeBytes;
+    }
+
+    @Config("lance.cache.session.index_cache_size_bytes")
+    @ConfigDescription("Lance session index cache size in bytes (default: Lance default)")
+    public LanceConfig setCacheSessionIndexCacheSizeBytes(Long cacheSessionIndexCacheSizeBytes)
+    {
+        this.cacheSessionIndexCacheSizeBytes = cacheSessionIndexCacheSizeBytes;
+        return this;
+    }
+
+    public Long getCacheSessionMetadataCacheSizeBytes()
+    {
+        return cacheSessionMetadataCacheSizeBytes;
+    }
+
+    @Config("lance.cache.session.metadata_cache_size_bytes")
+    @ConfigDescription("Lance session metadata cache size in bytes (default: Lance default)")
+    public LanceConfig setCacheSessionMetadataCacheSizeBytes(Long cacheSessionMetadataCacheSizeBytes)
+    {
+        this.cacheSessionMetadataCacheSizeBytes = cacheSessionMetadataCacheSizeBytes;
+        return this;
+    }
+
+    public int getCacheDatasetMaxEntries()
+    {
+        return cacheDatasetMaxEntries;
+    }
+
+    @Config("lance.cache.dataset.max_entries")
+    @ConfigDescription("Maximum number of cached datasets (default 100)")
+    public LanceConfig setCacheDatasetMaxEntries(int cacheDatasetMaxEntries)
+    {
+        this.cacheDatasetMaxEntries = cacheDatasetMaxEntries;
+        return this;
+    }
+
+    public int getCacheDatasetTtlMinutes()
+    {
+        return cacheDatasetTtlMinutes;
+    }
+
+    @Config("lance.cache.dataset.ttl_minutes")
+    @ConfigDescription("Dataset cache TTL in minutes (default 30)")
+    public LanceConfig setCacheDatasetTtlMinutes(int cacheDatasetTtlMinutes)
+    {
+        this.cacheDatasetTtlMinutes = cacheDatasetTtlMinutes;
+        return this;
+    }
 }