Share via


Databricks Runtime 18.2 (Beta)

The following release notes provide information about Databricks Runtime 18.2 (Beta).

This version incorporates all features, improvements, and bug fixes from all previous Databricks Runtime releases. Databricks released this version in April 2026.

Important

Databricks Runtime 18.2 is in Beta. The contents of the supported environments might change during the Beta. Changes can include the list of packages or versions of installed packages.

Behavioral changes

XPath no longer fetches external DTDs

When you evaluate XPath over XML, Azure Databricks no longer loads external Document Type Definitions (DTDs) declared in the document. Previously, XPath could fail when the XML contained an external DTD reference that pointed to a malformed URL or an unreachable endpoint. Because DTD validation is separate from XPath evaluation, queries that already succeeded return the same results as before. Queries that previously failed only during external DTD retrieval can now succeed.

NULL struct preservation in INSERT, MERGE, and streaming writes with schema evolution

For INSERT, MERGE, and streaming writes that use schema evolution, a NULL struct in the source is now stored as NULL in the target. Previously, that value was incorrectly materialized as a non-null struct with every field set to NULL, while the same operations without schema evolution preserved NULL structs correctly. If your code relied on receiving a non-null struct whose fields were all NULL, update your code to handle a NULL struct instead.

NullType (VOID) support in Delta tables

Delta tables now support VOID (Spark's NullType) columns. VOID columns are no longer dropped from the table schema upon reads. Writes are unaffected. See VOID type for restrictions on where VOID columns can appear in the schema.

SHOW CREATE TABLE supports metric views

SHOW CREATE TABLE now supports metric views. Previously, running this command on a metric view raised an error. The output for metric views includes the fully qualified three-part name with catalog (for example, CREATE VIEW catalog.db.my_metric_view ...), making it easier to recreate the metric view in the correct location.

Fix for LEFT OUTER JOIN LATERAL dropping rows

A bug that incorrectly dropped rows from LEFT OUTER JOIN LATERAL queries is now fixed. Queries using this construct now return the correct results. To temporarily revert to the previous behavior, set spark.databricks.sql.optimizer.lateralJoinPreserveOuterSemantic to true.

NATURAL JOIN respects case-insensitive column matching

NATURAL JOIN now correctly uses case-insensitive column matching when spark.sql.caseSensitive is set to false (the default). Previously, NATURAL JOIN used case-sensitive comparison to identify common columns, causing columns that differed only in case (for example, ID versus id) to not be recognized as matching. This caused NATURAL JOIN to silently produce cross-join results. This fix aligns NATURAL JOIN behavior with USING joins, which already handled case-insensitivity correctly. Queries affected by this bug now return correct results with properly joined columns.

SQL UDF dependency validation in Unity Catalog

Unity Catalog now enforces dependency validation for SQL user-defined functions (UDFs) to prevent access control bypass. Previously, SQL functions created through the REST API could reference dependencies the user did not have access to. SQL UDFs with invalid dependency configurations are now blocked from execution.

Optimized writes for partitioned Unity Catalog tables created with CRTAS

Optimized writes are now correctly applied to partitioned Unity Catalog tables created with CREATE OR REPLACE TABLE ... AS SELECT (CRTAS). Previously, CRTAS on new partitioned Unity Catalog tables did not apply optimized writes, resulting in a higher number of small files per partition. This fix can increase write latency. To revert to the previous behavior, set spark.databricks.delta.optimizeWrite.UCTableCRTAS.enabled to false.

AWS SDK v1 dependencies are shaded

AWS SDK v1 dependencies bundled with Databricks Runtime are now shaded and no longer directly available on the classpath. If your code depends on AWS SDK v1 libraries previously provided by Databricks Runtime, add them as explicit dependencies in your project. This change prepares for the migration to AWS SDK v2, following the end of AWS support for SDK v1.

Fix incorrect EPSG authority for ESRI-defined SRID 102100

The Coordinate Reference System (CRS) mapping for SRID 102100 now correctly uses ESRI:102100 instead of the incorrect EPSG:102100. This fix ensures geospatial data is stored with the correct authority for better interoperability with other systems.

New features and improvements

CREATE OR REPLACE support for temporary tables

CREATE OR REPLACE TEMP TABLE syntax is now supported, allowing you to create or replace temporary tables in a single statement. This eliminates the need to explicitly drop and recreate temporary tables.

agg() alias for measure() function

agg() is now available as an alias for the measure() function. This change is fully backward compatible. Existing queries that use measure() continue to work without modification, and agg() produces identical results when used with the same arguments.

Snowflake JDBC driver upgrade

The Snowflake JDBC driver is upgraded from 3.22.0 to 3.28.0.

pyspark.pipelines.testing namespace alias

pyspark.pipelines.testing is now available as a convenience alias for dlt.testing APIs. Import Lakeflow Spark Declarative Pipelines pipeline testing utilities through either namespace.

Improved Auto Loader listing performance

Auto Loader now uses a more efficient listing method that improves listing speed for cloud storage sources. If your stream triggers overlap due to long-running listing operations, this optimization can result in increased cloud listing API costs. Monitor your trigger intervals and adjust scheduling to prevent overlapping operations if cost increases are observed.

Delta table history includes write option flags

Delta table history (DESCRIBE HISTORY) now includes write option flags in the operationParameters column for WRITE and REPLACE TABLE operations. When the following options are explicitly enabled, they appear as boolean flags in the history (only included when true):

For WRITE and REPLACE TABLE operations:

  • isDynamicPartitionOverwrite: present when dynamic partition overwrite mode was used
  • canOverwriteSchema: present when schema overwrite (overwriteSchema) was enabled
  • canMergeSchema: present when schema merge (mergeSchema) was enabled

For REPLACE TABLE operations:

  • predicate: present when replaceWhere was used
  • isV1WriterSaveAsTableOverwrite: present when the replace was triggered by a .saveAsTable overwrite

Structured Streaming rewind and replay support

Structured Streaming now supports rewind and replay for streaming pipelines. This feature enables reprocessing from an earlier point in the stream to recover from failures such as schema changes, malformed input data, or logic errors, without requiring a full state reset. This doesn't modify the default behavior of existing streaming workloads.

Library upgrades

  • Upgraded Python libraries:

    No libraries were upgraded in this version.

  • Upgraded R libraries:

    No libraries were upgraded in this version.

  • Upgraded Java libraries:

    • io.delta.delta-sharing-client_2.13 from 1.3.9 to 1.3.10

Apache Spark

Databricks Runtime 18.2 includes Apache Spark 4.1.0. This release includes all Spark fixes and improvements included in Databricks Runtime 18.1, as well as the following additional bug fixes and improvements made to Spark:

  • SPARK-56219 Revert "[SC-225028][PS] Align groupby idxmax and idxmin skipna=False behavior with pandas 2/3"
  • SPARK-56204 Strip Alias wrappers from inline table row expressions in parser
  • SPARK-56186 Retire pypy
  • SPARK-56202 Refactor streaming join tests: split Base/Suite hierarchy and simplify mode dispatch
  • SPARK-56221 Feature parity between spark.catalog.* vs DDL commands
  • SPARK-56301 Fix typos in error-conditions.json
  • SPARK-55729 Support state data source reader for new state format v4 on stream-stream join
  • SPARK-56256 Add emptyDataFrame API to SparkSession
  • SPARK-56205 Validate base state store checkpoint ID before committing microbatch
  • SPARK-55827 Fix type hint for datasource workers
  • SPARK-55579 Rename PySpark error classes to be eval-type-agnostic
  • SPARK-56247 Fix the fall back behavior and type hint of inheritable_thread_target
  • SPARK-56244 Refine benchmark class layout in bench_eval_type.py
  • SPARK-56262 Remove the unnecessary mypy check disable for types
  • SPARK-55969 regr_r2 should treat first param as dependent variable
  • SPARK-56179 Consolidate error classes for type mismatch - part 3
  • SPARK-55630 Skip updating matched flag for non-outer side in stream-stream join v4
  • SPARK-56217 Fix bucketBy exceptions in connect
  • SPARK-56225 Improve View WITH SCHEMA EVOLUTION error message
  • SPARK-55865 Rename _LEGACY_ERROR_TEMP_1266 to CANNOT_TRUNCATE_EXTERNAL_TABLE
  • SPARK-55861 Rename _LEGACY_ERROR_TEMP_2045 to UNSUPPORTED_TABLE_CHANGE
  • SPARK-56166 Use ArrowBatchTransformer.enforce_schema to replace column-wise type coercion logic
  • SPARK-56245 Fix DataFrame.eval inplace assignment on pandas 3
  • SPARK-56062 Isolate memory_profiler to improve import time
  • SPARK-55964 system catalog wins over user catalog for BUILTIN, and SESSION schemas.
  • SPARK-56226 Catch analysis errors before InternalFrame.__init__ in .loc
  • SPARK-55723 Generalize enforce_schema error to PySparkTypeError
  • SPARK-54878 Add sortKeys option to to_json function
  • SPARK-56219 Align groupby idxmax and idxmin skipna=False behavior with pandas 2/3
  • SPARK-44065 Optimize BroadcastHashJoin skew in OptimizeSkewedJoin
  • SPARK-56179 Revert "[SC-225014][PYTHON] Consolidate error classes for type mismatch - part 3"
  • SPARK-53399 Merge Python UDFs
  • SPARK-56224 Polish type annotations for accumulators.py
  • SPARK-55448 Fix query events loss when session closes during query execution
  • SPARK-55862 Rename _LEGACY_ERROR_TEMP_2027 to UNEXPECTED_OPERATOR_IN_CORRELATED_SUBQUERY
  • SPARK-56201 Run SPARK-49829 tests with VCF joins now that StateDataSource supports it
  • SPARK-56179 Consolidate error classes for type mismatch - part 3
  • SPARK-56184 Replace assert with proper SparkRuntimeException in partition column parsing
  • SPARK-56206 Fix case-insensitive duplicate CTE name detection
  • SPARK-55866 Rename _LEGACY_ERROR_TEMP_2145 to OPTION_VALUE_EXCEEDS_ONE_CHARACTER
  • SPARK-56067 Lazy import psutil to improve import speed
  • SPARK-56066 Lazy import numpy to improve import speed
  • SPARK-55719 Remove deprecation warning for spark.sql.hive.convertCTAS
  • SPARK-56179 Consolidate error classes for type mismatch - part 2
  • SPARK-55510 Update structured-streaming-state-data-source.md doc to reflect deleteRange
  • SPARK-56050 Eagerly resolve IDENTIFIER() with string literals at parse time
  • SPARK-56151 Improve CreateVariable display string
  • SPARK-55751 Add metrics on state store loads from DFS
  • SPARK-56188 Align Series.map({}) with pandas 3 empty-dict behavior
  • SPARK-55964 Revert "[SC-223957] system catalog wins over user catalog for BUILTIN, and SESSION schemas."
  • SPARK-55577 Refactor SQL_SCALAR_ARROW_ITER_UDF wrapper, mapper, and serializer logic
  • SPARK-55596 DSV2 Enhanced Partition Stats Filtering
  • SPARK-56179 Revert "[SC-224777][PYTHON] Consolidate error classes for type mismatch - part 2"
  • SPARK-55964 system catalog wins over user catalog for BUILTIN, and SESSION schemas.
  • SPARK-56050 Revert "[SC-224153][SQL] Eagerly resolve IDENTIFIER() with string literals at parse time"
  • SPARK-56179 Consolidate error classes for type mismatch - part 2
  • SPARK-56102 UnionEstimation code cleanup
  • SPARK-51712 Swallow non-fatal Throwables when resolving tables/views in spark.catalog.listTables()
  • SPARK-55881 Add queryId, errorMessage, and rootExecutionId to SQL execution REST API
  • SPARK-56050 Eagerly resolve IDENTIFIER() with string literals at parse time
  • SPARK-55628 Integrate stream-stream join state format V4
  • SPARK-56187 Fix Series.argsort null ordering for pandas 3
  • SPARK-56167 Align astype with pandas 3 default string behavior
  • SPARK-56018 Use ruff as formatter
  • SPARK-56042 Fix swapped external/internal col family count metrics in RocksDBStateStoreProvider
  • SPARK-56179 Consolidate error classes for type mismatch - part 1
  • SPARK-56089 Align asinh/acosh with fdlibm algorithm for cross-engine compatibility
  • SPARK-55453 Fix LIKE pattern matching for supplementary Unicode characters
  • SPARK-52785 Simplifying super() syntax in PySpark
  • SPARK-56169 Fix ClassCastException in error reporting when GetStructField child type is changed by plan transformation
  • SPARK-55557 Hyperbolic functions should not overflow with large inputs
  • SPARK-47997 Add errors parameter to DataFrame.drop and Series.drop
  • SPARK-55008 Display Query ID in SparkUI
  • SPARK-54660 Add RTM trigger to python
  • SPARK-56047 Propagate distinctCount through Union in CBO statistics estimation
  • SPARK-56111 Add SparkContext.isDriver() and use it across the codebase
  • SPARK-55999 Enable forceSnapshotUploadOnLag by default
  • SPARK-55610 Add getExecutorInfos to StatusTracker in Python
  • SPARK-55728 Introduce conf for file checksum threadpool size and support disabling the threadpool
  • SPARK-55686 SizeEstimator takes care of Compact Object Headers
  • SPARK-56044 HistoryServerDiskManager does not delete app store on release when app is not in active map
  • SPARK-55809 HeapHistogram uses DiagnosticCommandMBean instead of jmap subprocess
  • SPARK-56122 Use pandas-aware numeric dtype check in Series.cov
  • SPARK-56113 Improve pandas 3 string restoration in pandas-on-Spark
  • SPARK-56118 Match pandas 3.0 bool handling in GroupBy.quantile
  • SPARK-53823 Implement allow list for real time mode
  • SPARK-55977 Fix isin() to use strict type matching like pandas
  • SPARK-54027 Kafka Source RTM support
  • SPARK-50284 Change docs for parseJson function
  • SPARK-56035 [SQL] Introduce AggregationValidator for single-pass resolver Aggregate validation
  • SPARK-55557 Revert "[SC-223720][SQL] Hyperbolic functions should not overflow with large inputs"
  • SPARK-56075 Remove a batch of dead python error classes
  • SPARK-55967 Unify column conversion for connect dataframe
  • SPARK-53915 Add RealTimeScanExec and ability to execute long running batches
  • SPARK-55557 Hyperbolic functions should not overflow with large inputs
  • SPARK-55147 Scope timestamp range for time-interval join retrieval in V4 state format
  • SPARK-56056 Support simpler worker profiling with viztracer
  • SPARK-55948 Add DSv2 CDC connector API, analyzer resolution, and SQL CHANGES clause
  • SPARK-54599 Reapply "[SC-219008][PYTHON] Refactor PythonExcept…
  • SPARK-55390 Consolidate SQL_SCALAR_ARROW_UDF wrapper, mapper, and serializer logic
  • SPARK-56023 Better load balance in LowLatencyMemoryStream
  • SPARK-55986 Upgrade black to 26.3.1
  • SPARK-55667 Move check_dependencies to init
  • SPARK-55145 Support Avro for timestamp based RocksDB state key encoders
  • SPARK-53970 Remove incorrect 'optional' tag for messageName…
  • SPARK-55059 Revert "[SC-224058][PYTHON] Remove empty table workaround in toPandas"
  • SPARK-50111 Add subplots support for pie charts in Plotly backend
  • SPARK-56081 Align idxmax and idxmin NA handling with pandas 3
  • SPARK-56080 Align Series.argmax/argmin with pandas 3.0 NA handling
  • SPARK-56060 Handle pandas 3 null string conversion in describe() for empty timestamp frames
  • SPARK-55059 Remove empty table workaround in toPandas
  • SPARK-55995 Support TIMESTAMP WITH LOCAL TIME ZONE in SQL syntax
  • SPARK-55976 Use Set instead of Seq for write privileges
  • SPARK-56073 Simplify the build of PythonRunnerConfMap
  • SPARK-55887 Special handling for CollectLimitExec/CollectTailExec to avoid full table scans
  • SPARK-55980 Always apply _cast_back_float in numeric arithmetic
  • SPARK-55357 Fix docstring for timestamp_add
  • SPARK-55667 Revert "[SC-223289][PYTHON][CONNECT] Move check_dependencies to init"
  • SPARK-54285 Revert "[PYTHON] Cache timezone info to avoid expensive timestamp conversion"
  • SPARK-56021 Increase AutoSnapshotRepair default maxChangeFileReplay threshold from 50 to 500
  • SPARK-55870 Add docs for Geo types
  • SPARK-55962 Use getShort instead of getInt casting in putShortsFromIntsLittleEndian on Little Endian platforms
  • SPARK-55903 Simplify MERGE Schema Evolution and Check Write Privileges
  • SPARK-55326 Release remote session when SPARK_CONNECT_RELEASE_SESSION_ON_EXIT is set
  • SPARK-55667 Move check_dependencies to init
  • SPARK-55884 Add v1StatsToV2Stats to DataSourceV2Relation
  • SPARK-55929 Add missing toString() to TableChange.UpdateColumnDefaultValue
  • SPARK-55851 Clarify types of datasource partition and read
  • SPARK-55828 Add DSV2 TableChange toString and fix missing error class for Merge Into Schema Evolution
  • SPARK-55790 Build a complete SRS registry using PROJ 9.7.1 data
  • SPARK-55645 Add serdeName to CatalogStorageFormat
  • SPARK-54796 Fix NPE caused by race condition between Executor initialization and shuffle migration
  • SPARK-55983 New single-pass analyzer functionality and bugfixes
  • SPARK-55964 Cache coherence: clear function registry on DROP DATABASE
  • SPARK-55868 Fix Predicate Pushdown for InMemoryTable for V2Filters
  • SPARK-55973 LeftSemi optimization for stream-stream join
  • SPARK-54665 Fix boolean vs string comparison to match pandas behavior
  • SPARK-55539 Allow casting from GeographyType to GeometryType
  • SPARK-55695 Avoid double planning in row-level operations
  • SPARK-55904 Utilize _check_same_session to narrow down types
  • SPARK-55965 Add warning when pandas >= 3.0.0 is used with PySpark
  • SPARK-55493 [SS] Do not mkdirs in streaming checkpoint offset/commit log directory in StateDataSource
  • SPARK-55851 Revert "[SC-223270][PYTHON] Clarify types of datasource partition and read"
  • SPARK-55645 Revert "[SC-221839][SQL] Add serdeName to CatalogStorageFormat"
  • SPARK-55640 Propagate WKB parsing errors for Geometry and Geography
  • SPARK-55693 Avoid deadlock by making SparkSession.observationManager a non-lazy val
  • SPARK-55528 Add default collation support for SQL UDFs
  • SPARK-55860 Use UNABLE_TO_INFER_SCHEMA instead of UNABLE_TO_INFER_SCHEMA_FOR_DATA_SOURCE
  • SPARK-55275 Add InvalidPlanInput sql states for sql/connect
  • SPARK-55645 Add serdeName to CatalogStorageFormat
  • SPARK-55716 Support NOT NULL constraint enforcement for V1 file source table inserts
  • SPARK-53226 Make ClosureCleaner work with Java22+
  • SPARK-55997 Set upper bound to prefixScan in RocksDB state store provider
  • SPARK-55851 Clarify types of datasource partition and read
  • SPARK-55954 Remove the incorrect overload type hint for fillna
  • SPARK-56016 Preserve named Series columns in concat with ignore_index on pandas 3
  • SPARK-55502 Unify UDF and UDTF Arrow conversion error handling
  • SPARK-55989 Preserve non-int64 index dtypes in restore_index
  • SPARK-55955 Remove overload type hint for drop
  • SPARK-55945 [SDP] Support structured identifiers for flows in SDP eager analysis protos
  • SPARK-55714 JDK might throw ArithmeticException without message
  • SPARK-55991 Fix unicode related SQL text corruption with parameters
  • SPARK-55696 Add explicit error to Encoders.bean for interface class
  • SPARK-55533 Support IGNORE NULLS / RESPECT NULLS for collect_set
  • SPARK-55987 Fix V4 windowed join timestamp extraction using findJoinKeyOrdinalForWatermark
  • SPARK-55946 Set up pandas_priority so mixed binary ops dispatch correctly to pandas-on-Spark
  • SPARK-55264 Add ExecuteOutput command to Spark Connect pipelines proto
  • SPARK-47672 Avoid double eval from filter pushDown w/ projection pushdown
  • SPARK-55780 Replace PNG logo with SVG in Spark Web UI
  • SPARK-55821 Enforce keyword-only arguments in serializer init methods
  • SPARK-55621 Fix ambiguous and unnecessary unicode usage
  • SPARK-55662 Implementation of idxmin Axis argument
  • SPARK-55631 ALTER TABLE must invalidate cache for DSv2 tables
  • SPARK-55692 Fix SupportsRuntimeFiltering and SupportsRuntimeV2Filtering documentation
  • SPARK-55928 New linter for config effectiveness in views and UDFs
  • SPARK-55440 Types Framework - Phase 1a - Core Type System Foundation
  • SPARK-55631 Revert "[SC-221596][SQL] ALTER TABLE must invalidate cache for DSv2 tables"
  • SPARK-55631 ALTER TABLE must invalidate cache for DSv2 tables
  • SPARK-55683 Optimize VectorizedPlainValuesReader.readUnsignedLongs
  • SPARK-55892 Fix unable to load state store because reused SST file was deleted by maintenance
  • SPARK-55946 Revert "[SC-223027][PS] Set up pandas_priority so mixed binary ops dispatch correctly to pandas-on-Spark"
  • SPARK-55891 Preserve the SQL scripting context inside EXECUTE IMMEDIATE
  • SPARK-55907 Fix incorrect error positions for invalid data types in CREATE FUNCTION
  • SPARK-55946 Set up pandas_priority so mixed binary ops dispatch correctly to pandas-on-Spark
  • SPARK-55694 Block constraints in CTAS/RTAS at parser level
  • SPARK-55682 ServiceLoader returned iterator may throw NoClassDefFoundError on hasNext()
  • SPARK-55155 Fix SET CATALOG to use special chars and backticks in the identifier name
  • SPARK-55932 Fix XML to variant parser hang on negative scale
  • SPARK-55673 Add more tests for nested type encoder
  • SPARK-55679 Fix dectecting sun.io.serialization.extendedDebugInfo on Java 25
  • SPARK-55957 Add 'DATA_SOURCE_NOT_FOUND' in Catalog.ERROR_HANDLING_RULES
  • SPARK-55052 Add AQEShuffleRead properties to Physical Plan Tree
  • SPARK-55652 Optimize VectorizedPlainValuesReader.readShorts() with direct array access for heap buffers
  • SPARK-55659 Improve EventLogFileWriter to log stop operation
  • SPARK-54666 Leave numeric types unchanged on to_numeric
  • SPARK-55654 Enable TreePattern pruning for EliminateSubqueryAliases and ResolveInlineTables
  • SPARK-55533 Revert "[SC-220538][SQL] Support IGNORE NULLS / RESPECT NULLS for collect_set"
  • SPARK-55901 Raise an error from Series.replace() with no arguments
  • SPARK-55896 Use numpy functions instead of builtins
  • SPARK-55655 Make CountVectorizer vocabulary deterministic when counts are equal
  • SPARK-55811 [SQL] Catch NonFatal instead of UnresolvedException when calling nodeWithOutputColumnsString
  • SPARK-55533 Support IGNORE NULLS / RESPECT NULLS for collect_set
  • SPARK-55435 Use StringBuilder instead of StringBuffer
  • SPARK-54807 Allow qualified names for built-in and session functions (#198171)
  • SPARK-55854 Tag pass-through duplicate attributes in Expand output to prevent AMBIGUOUS_REFERENCE
  • SPARK-55261 Implement Parquet read support for Geo types
  • SPARK-55416 Streaming Python Data Source memory leak when end-offset is not updated
  • SPARK-55465 Support GeometryType in convert_numpy
  • SPARK-55801 Fix type hint of _SimpleStreamReaderWrapper.getCache
  • SPARK-55800 Remove the unused type check for datetime.date
  • SPARK-55663 Unify module for data source functions
  • SPARK-55665 Unify how workers establish connection with the executor
  • SPARK-53446 Optimize BlockManager remove operations with cached block mappings
  • SPARK-55867 Fix StringMethods with pandas 3
  • SPARK-55501 Fix listagg distinct + within group order by bug
  • SPARK-55558 Add support for Tuple/Theta set operations
  • SPARK-55636 Add detailed errors in case of deduplication of invalid columns
  • SPARK-55788 Support ExtensionDType for integers in Pandas UDF
  • SPARK-55464 Support GeographyType in convert_numpy
  • SPARK-55530 Support Geo result sets in Hive and Thrift server
  • SPARK-55525 Fix UDTF_ARROW_TYPE_CONVERSION_ERROR with undefined error message parameter
  • SPARK-55626 Don't load metadata columns on Table unless needed in V2TableUtil
  • SPARK-55533 Revert "[SC-220538][SQL] Support IGNORE NULLS / RESPECT NULLS for collect_set"
  • SPARK-55435 Revert "[SC-219656][CORE][SQL] Use StringBuilder instead of StringBuffer"
  • SPARK-55533 Support IGNORE NULLS / RESPECT NULLS for collect_set
  • SPARK-54452 Fix empty response from SparkConnect server for spark.sql(...) inside FlowFunction
  • SPARK-55638 Refactor WKT serialization in GeometryModel
  • SPARK-55551 Improve BroadcastHashJoinExec output partitioning
  • SPARK-54314 Improve Server-Side debuggability in Spark Connect by capturing client application's file name and line numbers
  • SPARK-55517 Optimize VectorizedPlainValuesReader.readBytes() with direct array access for heap buffers
  • SPARK-55495 Fix EventLogFileWriters.closeWriter to handle checkError
  • SPARK-55279 Add sketch_funcs group for DataSketches SQL functions
  • SPARK-55435 Use StringBuilder instead of StringBuffer
  • SPARK-55064 Support query level indeterminate shuffle retry
  • SPARK-55411 SPJ may throw ArrayIndexOutOfBoundsException when join keys are less than cluster keys
  • SPARK-55451 Cursors must start collecting results on OPEN, not first FETCH
  • SPARK-54687 Add more edge cases with generators
  • SPARK-55691 GetStatus client
  • SPARK-55277 Add protobuf_funcs group for Protobuf SQL functions
  • SPARK-55822 Rename _LEGACY_ERROR_TEMP_0052 to CREATE_VIEW_WITH_IF_NOT_EXISTS_AND_REPLACE
  • SPARK-55236 Address unexpected exception in some CoarseGrainedExecutorBackendSuite test cases
  • SPARK-55275 SQL State Coverage: IllegalStateException
  • SPARK-55462 Reapply "[SC-221123][PYTHON] Support VariantType in convert_numpy"
  • SPARK-55062 Support proto2 extensions in protobuf functions
  • SPARK-55248 Clean up Jackson deprecated API usage in streaming.checkpointing.Checksum
  • SPARK-55250 Reduce Hive client calls on CREATE NAMESPACE
  • SPARK-55247 Clean up deprecated API usage related to o.a.c.io.input.BoundedInputStream
  • SPARK-55198 spark-sql should skip comment line with leading whitespaces
  • SPARK-55826 Rename _LEGACY_ERROR_TEMP_0006 to MERGE_INSERT_VALUE_COUNT_MISMATCH
  • SPARK-55127 Add avro_funcs group for Avro SQL functions
  • SPARK-54914 [SQL] Fixing DROP operator in pipe syntax to support qualified column names
  • SPARK-55113 EnsureRequirements should copy tags
  • SPARK-55074 Add test for Merge Into ANSI type coercion
  • SPARK-54217 Synchronize PythonRunner's MonitorThread kill decision
  • SPARK-54374 Enlarge the SVG viewBox attribute of SQL plan visualization initialization
  • SPARK-54971 Add WITH SCHEMA EVOLUTION syntax for SQL INSERT
  • SPARK-55065 Avoid making two JDBC API calls
  • SPARK-55033 Fix stringArgs of DSv2 writing commands
  • SPARK-55041 Cleanup some unused private funcion/val from core module
  • SPARK-55338 Centralize Spark Connect request decompression logic in gRPC interceptor
  • SPARK-55825 Rename _LEGACY_ERROR_TEMP_1309 to PARTITION_BY_NOT_ALLOWED_WITH_INSERT_INTO
  • SPARK-55492 Validate that eventTime in withWatermark is top-level column
  • SPARK-55802 Fix integer overflow when computing Arrow batch bytes
  • SPARK-55694 Block constraints in CTAS/RTAS at parser level
  • SPARK-55843 Handle the unit of datetime64 and timedelta64 dtypes
  • SPARK-55824 Rename _LEGACY_ERROR_TEMP_1034 to WINDOW_FUNCTION_NOT_ALLOWED_IN_CLAUSE
  • SPARK-55819 Refactor ExpandExec to be more succinct
  • SPARK-55341 Add storage level flag for cached local relations
  • SPARK-54599 Revert "[SC-219008][PYTHON] Refactor PythonException so it can take errorClass with sqlstate"
  • SPARK-46167 Add axis implementation to DataFrame.rank
  • SPARK-54599 Refactor PythonException so it can take errorClass with sqlstate
  • SPARK-55529 Reapply [ES-1721989][SC-220716][PYTHON] Restore Arrow-level batch merge for non-iterator applyInPandas
  • SPARK-55794 Always alias OuterReferences
  • SPARK-55583 Validate Arrow schema types in Python data source
  • SPARK-37711 Reduce pandas describe job count from O(N) to O(1)
  • SPARK-46168 Add axis argument for idxmax
  • SPARK-46162 Implement nunique with axis=1
  • SPARK-55552 Add VariantType support to ColumnarBatchRow.copy() and MutableColumnarRow
  • SPARK-55647 Fix ConstantPropagation incorrectly replacing attributes with non-binary-stable collations
  • SPARK-55747 Fix NPE when accessing elements from an array that is null
  • SPARK-55757 Improve spark.task.cpus validation
  • SPARK-55699 Inconsistent reading of LowLatencyClock when used together with ManualClock
  • SPARK-55702 Support filter predicate in window aggregate functions
  • SPARK-55510 Fix deleteRange of Rocksdb state store to call changelogWriter
  • SPARK-55739 Optimize OnHeapColumnVector.putIntsLittleEndian/putLongsLittleEndian using Platform.copyMemory on little-endian platforms
  • SPARK-55730 Not make timezone lower case
  • SPARK-55701 ES-1694761[SS] Fix race condition in CompactibleFileStreamLog.allFiles
  • SPARK-55462 Revert "[SC-221123][PYTHON] Support VariantType in convert_numpy"
  • SPARK-55144 Introduce new state format version for performant stream-stream join
  • SPARK-55606 Server-side implementation of GetStatus API
  • SPARK-55462 Support VariantType in convert_numpy
  • SPARK-55600 Fix pandas to arrow loses row count when schema has 0 columns on classic
  • SPARK-55700 Fix handling integer keys on Series with non-integer index
  • SPARK-55349 Consolidate pandas-to-Arrow conversion utilities in serializers
  • SPARK-55681 Fix singleton DataType equality after deserialization (reattempt)
  • SPARK-55681 Revert "[SC-221427][SC-214079][SQL] Fix singleton DataType equality after deserialization"
  • SPARK-55681 Fix singleton DataType equality after deserialization
  • SPARK-55674 Optimize 0-column table conversion in Spark Connect
  • SPARK-55323 Reapply "[SC-218885][PYTHON] Move UDF metadata to EvalConf to simplify worker protocol"
  • SPARK-55322 Reapply [SC-221062][SQL] MaxBy and MinBy Overload with K Elements
  • SPARK-55323 Revert "[SC-218885][PYTHON] Move UDF metadata to EvalConf to simplify worker protocol"
  • SPARK-55615 Move SparkContext import into class branch
  • SPARK-55323 Move UDF metadata to EvalConf to simplify worker protocol
  • SPARK-55648 Handle an unexpected keyword argument error groupby(axis) with pandas 3
  • SPARK-55647 Revert "[SC-221274][SQL] Fix ConstantPropagation incorrectly replacing attributes with non-binary-stable collations"
  • SPARK-55646 Refactored SQLExecution.withThreadLocalCaptured to separate thread-local capture from execution
  • SPARK-54854 Add a UUIDv7 queryId to SQLExecution Events
  • SPARK-55619 Fix custom metrics in case of coalesced partitions
  • SPARK-55647 Fix ConstantPropagation incorrectly replacing attributes with non-binary-stable collations
  • SPARK-55322 Revert "[SC-221062][SQL] MaxBy and MinBy Overload with K Elements"
  • SPARK-54740 Start faulthandler early in daemon mode
  • SPARK-55493 [SS] Do not mkdirs in streaming checkpoint state directory in StateDataSource
  • SPARK-55322 MaxBy and MinBy Overload with K Elements
  • SPARK-55625 Fix StringOps to make str dtype work properly
  • SPARK-55161 Reapply "[SC-218867][PYTHON] Support profilers on python data source"
  • SPARK-55505 Fix NPE on reading EXECUTION_ROOT_ID_KEY in concurrent scenarios
  • SPARK-55111 Recheckin Unfinished repartitioning detection on query restart
  • SPARK-55593 Unify aggregation state for vector_avg/vector_sum
  • SPARK-55500 Fix analyzer cycle between ApplyDefaultCollation, ExtractWindowExpressions and CollationTypeCasts
  • SPARK-55494 Introduce iterator/prefixScan with multi-values in StateStore API
  • SPARK-55561 Add retries for all Kafka admin client methods
  • SPARK-55296 Support CoW mode with pandas 3
  • SPARK-55479 Fix style issues in SparkShreddingUtils
  • SPARK-55372 Fix SHOW CREATE TABLE for tables / views with default collation
  • SPARK-55333 Enable DateType and TimeType in convert_numpy
  • SPARK-55129 Introduce new key encoders for timestamp as a first class (UnsafeRow)
  • SPARK-46163 DataFrame.update parameters filter_func and errors
  • SPARK-55372 Revert "[SC-220571][SQL] Fix SHOW CREATE TABLE for tables / views with default collation"
  • SPARK-55480 Remove all unused noqa for ruff
  • SPARK-55471 Add optimizer support for SequentialStreamingUnion
  • SPARK-55584 Produce better error on scalar subquery to EXEC IMMEDIATE
  • SPARK-55161 Revert "[SC-218867][PYTHON] Support profilers on python data source"
  • SPARK-55506 Pass explicit input schema to to_pandas in CogroupPandasUDFSerializer
  • SPARK-55586 Add jdbc.py Example
  • SPARK-55161 Support profilers on python data source
  • SPARK-55529 Revert "[SC-220716][PYTHON] Restore Arrow-level batch merge for non-iterator applyInPandas"
  • SPARK-55385 Mitigate the recomputation in zipWithIndex
  • SPARK-55529 Restore Arrow-level batch merge for non-iterator applyInPandas
  • SPARK-55389 Consolidate SQL_MAP_ARROW_ITER_UDF wrapper, mapper, and serializer logic
  • SPARK-55406 Reimplement the thread pool for ExecutePlanResponseReattachableIterator
  • SPARK-55372 Fix SHOW CREATE TABLE for tables / views with default collation
  • SPARK-55367 Use venv for run-pip-tests
  • SPARK-55355 Upgrade mypy version to the latest
  • SPARK-55460 Remove E203 from ruff's ignore list
  • SPARK-55541 Support Geometry and Geography in catalyst type converters
  • SPARK-55449 Enable WKB parsing and writing for Geography
  • SPARK-55339 Implement WKT writer support for Geo objects
  • SPARK-54122 Implement TwsTester in Scala
  • SPARK-54805 Implement TwsTester in PySpark
  • SPARK-55256 Reapply "[SC-218596][SQL] Support IGNORE NULLS / RESPECT NULLS for array_agg and collect_list"
  • SPARK-55156 Deal with include_groups for groupby.apply
  • SPARK-55401 Add retry logic and timeout handling to pyspark install download
  • SPARK-55229 Implement DataFrame.zipWithIndex in PySpark
  • SPARK-55462 Support UserDefinedType in convert_numpy
  • SPARK-55483 Fix NPE in PivotFirst when pivot column is a non-atomic type with null values
  • SPARK-55490 Make groupby(as_index=False) include a grouping that is not in the DataFrame with pandas 3
  • SPARK-55473 Replace itertools.tee with chain in applyInPandasWithState
  • SPARK-55404 Always raise KeyboardInterrupt from SIGINT handler
  • SPARK-55407 Replace logger.warn with logger.warning

Databricks ODBC/JDBC driver support

Databricks supports ODBC/JDBC drivers released in the past 2 years. Please download the recently released drivers and upgrade (download ODBC, download JDBC).

System environment

  • Operating System: Ubuntu 24.04.4 LTS
  • Java: Zulu21.48+15-CA
  • Scala: 2.13.16
  • Python: 3.12.3
  • R: 4.5.1
  • Delta Lake: 4.1.0

Installed Python libraries

Library Version Library Version Library Version
aiohappyeyeballs 2.4.4 aiohttp 3.11.10 aiosignal 1.2.0
annotated-doc 0.0.4 annotated-types 0.7.0 anyio 4.7.0
argon2-cffi 21.3.0 argon2-cffi-bindings 21.2.0 arro3-core 0.6.5
arrow 1.3.0 asttokens 3.0.0 astunparse 1.6.3
async-lru 2.0.4 attrs 24.3.0 autocommand 2.2.2
azure-common 1.1.28 azure-core 1.37.0 azure-identity 1.20.0
azure-mgmt-core 1.6.0 azure-mgmt-web 8.0.0 azure-storage-blob 12.28.0
azure-storage-file-datalake 12.22.0 babel 2.16.0 backports.tarfile 1.2.0
beautifulsoup4 4.12.3 black 24.10.0 bleach 6.2.0
blinker 1.7.0 boto3 1.40.45 botocore 1.40.45
cachetools 5.5.1 certifi 2025.4.26 cffi 1.17.1
chardet 4.0.0 charset-normalizer 3.3.2 click 8.1.8
cloudpickle 3.0.0 comm 0.2.1 contourpy 1.3.1
cryptography 44.0.1 cycler 0.11.0 Cython 3.1.5
databricks-agents 1.9.1 databricks-sdk 0.67.0 dataclasses-json 0.6.7
dbus-python 1.3.2 debugpy 1.8.11 decorator 5.1.1
defusedxml 0.7.1 deltalake 1.1.4 Deprecated 1.2.18
distlib 0.3.9 docstring-to-markdown 0.11 executing 1.2.0
facets-overview 1.1.1 fastapi 0.128.0 fastjsonschema 2.21.1
filelock 3.17.0 fonttools 4.55.3 fqdn 1.5.1
frozenlist 1.5.0 fsspec 2023.5.0 gitdb 4.0.11
GitPython 3.1.43 google-api-core 2.28.1 google-auth 2.47.0
google-cloud-core 2.5.0 google-cloud-storage 3.7.0 google-crc32c 1.8.0
google-resumable-media 2.8.0 googleapis-common-protos 1.65.0 grpcio 1.67.0
grpcio-status 1.67.0 h11 0.16.0 hf-xet 1.2.0
httpcore 1.0.9 httplib2 0.20.4 httpx 0.28.1
huggingface_hub 1.2.4 idna 3.7 importlib_metadata 8.5.0
inflect 7.3.1 iniconfig 1.1.1 ipyflow-core 0.0.209
ipykernel 6.29.5 ipython 8.30.0 ipython-genutils 0.2.0
ipywidgets 7.8.1 isodate 0.7.2 isoduration 20.11.0
jaraco.collections 5.1.0 jaraco.context 5.3.0 jaraco.functools 4.0.1
jaraco.text 3.12.1 jedi 0.19.2 Jinja2 3.1.6
jiter 0.12.0 jmespath 1.0.1 joblib 1.4.2
json5 0.9.25 jsonpatch 1.33 jsonpointer 3.0.0
jsonschema 4.23.0 jsonschema-specifications 2023.7.1 jupyter-events 0.12.0
jupyter-lsp 2.2.5 jupyter_client 8.6.3 jupyter_core 5.7.2
jupyter_server 2.15.0 jupyter_server_terminals 0.5.3 jupyterlab 4.3.4
jupyterlab_pygments 0.3.0 jupyterlab_server 2.27.3 jupyterlab_widgets 1.1.11
kiwisolver 1.4.8 langchain-core 1.2.6 langchain-openai 1.1.6
langsmith 0.6.1 launchpadlib 1.11.0 lazr.restfulclient 0.14.6
lazr.uri 1.0.6 litellm 1.75.9 markdown-it-py 2.2.0
MarkupSafe 3.0.2 marshmallow 3.26.2 matplotlib 3.10.0
matplotlib-inline 0.1.7 mccabe 0.7.0 mdurl 0.1.0
mistune 3.1.2 mlflow-skinny 3.8.1 mmh3 5.2.0
more-itertools 10.3.0 msal 1.34.0 msal-extensions 1.3.1
multidict 6.1.0 mypy-extensions 1.0.0 nbclient 0.10.2
nbconvert 7.16.6 nbformat 5.10.4 nest-asyncio 1.6.0
nodeenv 1.10.0 notebook 7.3.2 notebook_shim 0.2.4
numpy 2.1.3 oauthlib 3.2.2 openai 2.14.0
opentelemetry-api 1.39.1 opentelemetry-proto 1.39.1 opentelemetry-sdk 1.39.1
opentelemetry-semantic-conventions 0.60b1 orjson 3.11.5 overrides 7.4.0
packaging 24.2 pandas 2.2.3 pandocfilters 1.5.0
parso 0.8.4 pathspec 0.10.3 patsy 1.0.1
pexpect 4.8.0 pillow 11.1.0 pip 25.0.1
platformdirs 4.3.7 plotly 5.24.1 pluggy 1.5.0
prometheus_client 0.21.1 prompt-toolkit 3.0.43 propcache 0.3.1
proto-plus 1.27.0 protobuf 5.29.4 psutil 5.9.0
psycopg2 2.9.11 ptyprocess 0.7.0 pure-eval 0.2.2
pyarrow 21.0.0 pyasn1 0.4.8 pyasn1-modules 0.2.8
pyccolo 0.0.71 pycparser 2.21 pydantic 2.10.6
pydantic_core 2.27.2 pyflakes 3.2.0 Pygments 2.19.1
PyGObject 3.48.2 pyiceberg 0.10.0 PyJWT 2.10.1
pyodbc 5.2.0 pyparsing 3.2.0 pyright 1.1.394
pyroaring 1.0.3 pytest 8.3.5 python-dateutil 2.9.0.post0
python-dotenv 1.2.1 python-json-logger 3.2.1 python-lsp-jsonrpc 1.1.2
python-lsp-server 1.12.2 pytoolconfig 1.2.6 pytz 2024.1
PyYAML 6.0.2 pyzmq 26.2.0 referencing 0.30.2
regex 2024.11.6 requests 2.32.3 requests-toolbelt 1.0.0
rfc3339-validator 0.1.4 rfc3986-validator 0.1.1 rich 13.9.4
rope 1.13.0 rpds-py 0.22.3 rsa 4.9.1
s3transfer 0.14.0 scikit-learn 1.6.1 scipy 1.15.3
seaborn 0.13.2 Send2Trash 1.8.2 setuptools 78.1.1
shellingham 1.5.4 six 1.17.0 smmap 5.0.0
sniffio 1.3.0 sortedcontainers 2.4.0 soupsieve 2.5
sqlparse 0.5.5 ssh-import-id 5.11 stack-data 0.6.3
starlette 0.50.0 strictyaml 1.7.3 tenacity 9.0.0
terminado 0.17.1 threadpoolctl 3.5.0 tiktoken 0.12.0
tinycss2 1.4.0 tokenize_rt 6.1.0 tokenizers 0.22.2
tomli 2.0.1 tornado 6.5.1 tqdm 4.67.1
traitlets 5.14.3 typeguard 4.3.0 typer-slim 0.21.1
types-python-dateutil 2.9.0.20251115 typing-inspect 0.9.0 typing_extensions 4.12.2
tzdata 2024.1 ujson 5.10.0 unattended-upgrades 0.1
uri-template 1.3.0 urllib3 2.3.0 uuid_utils 0.12.0
uvicorn 0.40.0 virtualenv 20.29.3 wadllib 1.3.6
wcwidth 0.2.5 webcolors 25.10.0 webencodings 0.5.1
websocket-client 1.8.0 whatthepatch 1.0.2 wheel 0.45.1
whenever 0.7.3 widgetsnbextension 3.6.6 wrapt 1.17.0
yapf 0.40.2 yarl 1.18.0 zipp 3.21.0
zstandard 0.23.0

Installed R libraries

R libraries are installed from the Posit Package Manager CRAN snapshot on 2025-11-20.

Library Version Library Version Library Version
arrow 22.0.0 askpass 1.2.1 assertthat 0.2.1
backports 1.5.0 base 4.5.1 base64enc 0.1-3
bigD 0.3.1 bit 4.6.0 bit64 4.6.0-1
bitops 1.0-9 blob 1.2.4 boot 1.3-30
brew 1.0-10 brio 1.1.5 broom 1.0.10
bslib 0.9.0 cachem 1.1.0 callr 3.7.6
caret 7.0-1 cellranger 1.1.0 chron 2.3-62
class 7.3-22 cli 3.6.5 clipr 0.8.0
clock 0.7.3 cluster 2.1.6 codetools 0.2-20
commonmark 2.0.0 compiler 4.5.1 config 0.3.2
conflicted 1.2.0 cpp11 0.5.2 crayon 1.5.3
credentials 2.0.3 curl 7.0.0 data.table 1.17.8
datasets 4.5.1 DBI 1.2.3 dbplyr 2.5.1
desc 1.4.3 devtools 2.4.6 diagram 1.6.5
diffobj 0.3.6 digest 0.6.39 downlit 0.4.5
dplyr 1.1.4 dtplyr 1.3.2 e1071 1.7-16
ellipsis 0.3.2 evaluate 1.0.5 fansi 1.0.7
farver 2.1.2 fastmap 1.2.0 fontawesome 0.5.3
forcats 1.0.1 foreach 1.5.2 foreign 0.8-86
fs 1.6.6 future 1.68.0 future.apply 1.20.0
gargle 1.6.0 generics 0.1.4 gert 2.2.0
ggplot2 4.0.1 gh 1.5.0 git2r 0.36.2
gitcreds 0.1.2 glmnet 4.1-10 globals 0.18.0
glue 1.8.0 googledrive 2.1.2 googlesheets4 1.1.2
gower 1.0.2 graphics 4.5.1 grDevices 4.5.1
grid 4.5.1 gridExtra 2.3 gsubfn 0.7
gt 1.1.0 gtable 0.3.6 hardhat 1.4.2
haven 2.5.5 highr 0.11 hms 1.1.4
htmltools 0.5.8.1 htmlwidgets 1.6.4 httpuv 1.6.16
httr 1.4.7 httr2 1.2.1 ids 1.0.1
ini 0.3.1 ipred 0.9-15 isoband 0.2.7
iterators 1.0.14 jquerylib 0.1.4 jsonlite 2.0.0
juicyjuice 0.1.0 KernSmooth 2.23-22 knitr 1.50
labeling 0.4.3 later 1.4.4 lattice 0.22-5
lava 1.8.2 lifecycle 1.0.4 listenv 0.10.0
litedown 0.8 lubridate 1.9.4 magrittr 2.0.4
markdown 2.0 MASS 7.3-60.0.1 Matrix 1.6-5
memoise 2.0.1 methods 4.5.1 mgcv 1.9-1
mime 0.13 miniUI 0.1.2 mlflow 3.6.0
ModelMetrics 1.2.2.2 modelr 0.1.11 nlme 3.1-164
nnet 7.3-19 numDeriv 2016.8-1.1 openssl 2.3.4
otel 0.2.0 parallel 4.5.1 parallelly 1.45.1
pillar 1.11.1 pkgbuild 1.4.8 pkgconfig 2.0.3
pkgdown 2.2.0 pkgload 1.4.1 plogr 0.2.0
plyr 1.8.9 praise 1.0.0 prettyunits 1.2.0
pROC 1.19.0.1 processx 3.8.6 prodlim 2025.04.28
profvis 0.4.0 progress 1.2.3 progressr 0.18.0
promises 1.5.0 proto 1.0.0 proxy 0.4-27
ps 1.9.1 purrr 1.2.0 R6 2.6.1
ragg 1.5.0 randomForest 4.7-1.2 rappdirs 0.3.3
rcmdcheck 1.4.0 RColorBrewer 1.1-3 Rcpp 1.1.0
RcppEigen 0.3.4.0.2 reactable 0.4.4 reactR 0.6.1
readr 2.1.6 readxl 1.4.5 recipes 1.3.1
rematch 2.0.0 rematch2 2.1.2 remotes 2.5.0
reprex 2.1.1 reshape2 1.4.5 rlang 1.1.6
rmarkdown 2.30 RODBC 1.3-26 roxygen2 7.3.3
rpart 4.1.23 rprojroot 2.1.1 Rserve 1.8-15
RSQLite 2.4.4 rstudioapi 0.17.1 rversions 3.0.0
rvest 1.0.5 S7 0.2.1 sass 0.4.10
scales 1.4.0 selectr 0.4-2 sessioninfo 1.2.3
shape 1.4.6.1 shiny 1.11.1 sourcetools 0.1.7-1
sparklyr 1.9.3 SparkR 4.1.0 sparsevctrs 0.3.4
spatial 7.3-17 splines 4.5.1 sqldf 0.4-11
SQUAREM 2021.1 stats 4.5.1 stats4 4.5.1
stringi 1.8.7 stringr 1.6.0 survival 3.5-8
swagger 5.17.14.1 sys 3.4.3 systemfonts 1.3.1
tcltk 4.5.1 testthat 3.3.0 textshaping 1.0.4
tibble 3.3.0 tidyr 1.3.1 tidyselect 1.2.1
tidyverse 2.0.0 timechange 0.3.0 timeDate 4051.111
tinytex 0.58 tools 4.5.1 tzdb 0.5.0
urlchecker 1.0.1 usethis 3.2.1 utf8 1.2.6
utils 4.5.1 uuid 1.2-1 V8 8.0.1
vctrs 0.6.5 viridisLite 0.4.2 vroom 1.6.6
waldo 0.6.2 whisker 0.4.1 withr 3.0.2
xfun 0.54 xml2 1.5.0 xopen 1.0.1
xtable 1.8-4 yaml 2.3.10 zeallot 0.2.0
zip 2.3.3

Installed Java and Scala libraries (Scala 2.13 cluster version)

Group ID Artifact ID Version
antlr antlr 2.7.7
com.clearspring.analytics stream 2.9.8
com.databricks Rserve 1.8-3
com.databricks databricks-sdk-java 0.53.0
com.databricks jets3t 0.7.1-0
com.databricks.scalapb scalapb-runtime_2.13 0.4.15-11
com.esotericsoftware kryo-shaded 4.0.3
com.esotericsoftware minlog 1.3.0
com.fasterxml classmate 1.5.1
com.fasterxml.jackson.core jackson-annotations 2.18.2
com.fasterxml.jackson.core jackson-core 2.18.2
com.fasterxml.jackson.core jackson-databind 2.18.2
com.fasterxml.jackson.dataformat jackson-dataformat-yaml 2.15.2
com.fasterxml.jackson.datatype jackson-datatype-joda 2.18.2
com.fasterxml.jackson.datatype jackson-datatype-jsr310 2.18.3
com.fasterxml.jackson.module jackson-module-paranamer 2.18.2
com.fasterxml.jackson.module jackson-module-scala_2.13 2.18.2
com.github.ben-manes.caffeine caffeine 2.9.3
com.github.blemale scaffeine_2.13 4.1.0
com.github.fommil jniloader 1.1
com.github.fommil.netlib native_ref-java 1.1
com.github.fommil.netlib native_ref-java 1.1-natives
com.github.fommil.netlib native_system-java 1.1
com.github.fommil.netlib native_system-java 1.1-natives
com.github.fommil.netlib netlib-native_ref-linux-x86_64 1.1-natives
com.github.fommil.netlib netlib-native_system-linux-x86_64 1.1-natives
com.github.luben zstd-jni 1.5.7-6
com.github.virtuald curvesapi 1.08
com.github.wendykierp JTransforms 3.1
com.google.api.grpc proto-google-common-protos 2.5.1
com.google.auth google-auth-library-credentials 1.20.0
com.google.auth google-auth-library-oauth2-http 1.20.0
com.google.auto.value auto-value-annotations 1.10.4
com.google.code.findbugs jsr305 3.0.0
com.google.code.gson gson 2.11.0
com.google.crypto.tink tink 1.16.0
com.google.errorprone error_prone_annotations 2.36.0
com.google.flatbuffers flatbuffers-java 25.2.10
com.google.guava failureaccess 1.0.3
com.google.guava guava 33.4.8-jre
com.google.http-client google-http-client 1.43.3
com.google.http-client google-http-client-gson 1.43.3
com.google.j2objc j2objc-annotations 3.0.0
com.google.protobuf protobuf-java 3.25.5
com.google.protobuf protobuf-java-util 3.25.5
com.helger profiler 1.1.1
com.ibm.icu icu4j 75.1
com.jcraft jsch 0.1.55
com.lihaoyi sourcecode_2.13 0.1.9
com.microsoft.azure azure-data-lake-store-sdk 2.3.10
com.microsoft.sqlserver mssql-jdbc 12.8.0.jre11
com.microsoft.sqlserver mssql-jdbc 12.8.0.jre8
com.ning compress-lzf 1.1.2
com.sun.mail javax.mail 1.5.2
com.sun.xml.bind jaxb-core 2.2.11
com.sun.xml.bind jaxb-impl 2.2.11
com.tdunning json 1.8
com.thoughtworks.paranamer paranamer 2.8
com.trueaccord.lenses lenses_2.13 0.4.13
com.twitter chill-java 0.10.0
com.twitter chill_2.13 0.10.0
com.twitter util-app_2.13 19.8.1
com.twitter util-core_2.13 19.8.1
com.twitter util-function_2.13 19.8.1
com.twitter util-jvm_2.13 19.8.1
com.twitter util-lint_2.13 19.8.1
com.twitter util-registry_2.13 19.8.1
com.twitter util-stats_2.13 19.8.1
com.typesafe config 1.4.3
com.typesafe.scala-logging scala-logging_2.13 3.9.2
com.uber h3 3.7.3
com.univocity univocity-parsers 2.9.1
com.zaxxer HikariCP 4.0.3
com.zaxxer SparseBitSet 1.3
commons-cli commons-cli 1.10.0
commons-codec commons-codec 1.19.0
commons-collections commons-collections 3.2.2
commons-dbcp commons-dbcp 1.4
commons-fileupload commons-fileupload 1.6.0
commons-httpclient commons-httpclient 3.1
commons-io commons-io 2.21.0
commons-lang commons-lang 2.6
commons-logging commons-logging 1.1.3
commons-pool commons-pool 1.5.4
dev.ludovic.netlib arpack 3.0.4
dev.ludovic.netlib blas 3.0.4
dev.ludovic.netlib lapack 3.0.4
info.ganglia.gmetric4j gmetric4j 1.0.10
io.airlift aircompressor 2.0.2
io.delta delta-sharing-client_2.13 1.3.10
io.dropwizard.metrics metrics-annotation 4.2.37
io.dropwizard.metrics metrics-core 4.2.37
io.dropwizard.metrics metrics-graphite 4.2.37
io.dropwizard.metrics metrics-healthchecks 4.2.37
io.dropwizard.metrics metrics-jetty10 4.2.37
io.dropwizard.metrics metrics-jmx 4.2.37
io.dropwizard.metrics metrics-json 4.2.37
io.dropwizard.metrics metrics-jvm 4.2.37
io.dropwizard.metrics metrics-servlets 4.2.37
io.github.java-diff-utils java-diff-utils 4.15
io.netty netty-all 4.2.7.Final
io.netty netty-buffer 4.2.7.Final
io.netty netty-codec 4.2.7.Final
io.netty netty-codec-base 4.2.7.Final
io.netty netty-codec-classes-quic 4.2.7.Final
io.netty netty-codec-compression 4.2.7.Final
io.netty netty-codec-http 4.2.7.Final
io.netty netty-codec-http2 4.2.7.Final
io.netty netty-codec-http3 4.2.7.Final
io.netty netty-codec-marshalling 4.2.7.Final
io.netty netty-codec-native-quic 4.2.7.Final-linux-aarch_64
io.netty netty-codec-native-quic 4.2.7.Final-linux-x86_64
io.netty netty-codec-native-quic 4.2.7.Final-osx-aarch_64
io.netty netty-codec-native-quic 4.2.7.Final-osx-x86_64
io.netty netty-codec-native-quic 4.2.7.Final-windows-x86_64
io.netty netty-codec-protobuf 4.2.7.Final
io.netty netty-codec-socks 4.2.7.Final
io.netty netty-common 4.2.7.Final
io.netty netty-handler 4.2.7.Final
io.netty netty-handler-proxy 4.2.7.Final
io.netty netty-resolver 4.2.7.Final
io.netty netty-tcnative-boringssl-static 2.0.74.Final-db-r0-linux-aarch_64
io.netty netty-tcnative-boringssl-static 2.0.74.Final-db-r0-linux-x86_64
io.netty netty-tcnative-boringssl-static 2.0.74.Final-db-r0-osx-aarch_64
io.netty netty-tcnative-boringssl-static 2.0.74.Final-db-r0-osx-x86_64
io.netty netty-tcnative-boringssl-static 2.0.74.Final-db-r0-windows-x86_64
io.netty netty-tcnative-classes 2.0.74.Final
io.netty netty-transport 4.2.7.Final
io.netty netty-transport-classes-epoll 4.2.7.Final
io.netty netty-transport-classes-io_uring 4.2.7.Final
io.netty netty-transport-classes-kqueue 4.2.7.Final
io.netty netty-transport-native-epoll 4.2.7.Final
io.netty netty-transport-native-epoll 4.2.7.Final-linux-aarch_64
io.netty netty-transport-native-epoll 4.2.7.Final-linux-riscv64
io.netty netty-transport-native-epoll 4.2.7.Final-linux-x86_64
io.netty netty-transport-native-io_uring 4.2.7.Final-linux-aarch_64
io.netty netty-transport-native-io_uring 4.2.7.Final-linux-riscv64
io.netty netty-transport-native-io_uring 4.2.7.Final-linux-x86_64
io.netty netty-transport-native-kqueue 4.2.7.Final-osx-aarch_64
io.netty netty-transport-native-kqueue 4.2.7.Final-osx-x86_64
io.netty netty-transport-native-unix-common 4.2.7.Final
io.opencensus opencensus-api 0.31.1
io.opencensus opencensus-contrib-http-util 0.31.1
io.prometheus simpleclient 0.16.1-databricks
io.prometheus simpleclient_common 0.16.1-databricks
io.prometheus simpleclient_dropwizard 0.16.1-databricks
io.prometheus simpleclient_pushgateway 0.16.1-databricks
io.prometheus simpleclient_servlet 0.16.1-databricks
io.prometheus simpleclient_servlet_common 0.16.1-databricks
io.prometheus simpleclient_tracer_common 0.16.1-databricks
io.prometheus simpleclient_tracer_otel 0.16.1-databricks
io.prometheus simpleclient_tracer_otel_agent 0.16.1-databricks
io.prometheus.jmx collector 0.18.0
jakarta.annotation jakarta.annotation-api 1.3.5
jakarta.servlet jakarta.servlet-api 4.0.3
jakarta.validation jakarta.validation-api 2.0.2
jakarta.ws.rs jakarta.ws.rs-api 2.1.6
javax.activation activation 1.1.1
javax.annotation javax.annotation-api 1.3.2
javax.el javax.el-api 2.2.4
javax.jdo jdo-api 3.0.1
javax.media jai_core jai_core_dummy
javax.transaction jta 1.1
javax.transaction transaction-api 1.1
javax.xml.bind jaxb-api 2.2.11
javolution javolution 5.5.1
jline jline 2.14.6
joda-time joda-time 2.14.0
net.java.dev.jna jna 5.8.0
net.razorvine pickle 1.5
net.sf.jpam jpam 1.1
net.sf.opencsv opencsv 2.3
net.sf.supercsv super-csv 2.2.0
net.snowflake snowflake-ingest-sdk 0.9.6
net.sourceforge.f2j arpack_combined_all 0.1
org.acplt.remotetea remotetea-oncrpc 1.1.2
org.antlr ST4 4.0.4
org.antlr antlr-runtime 3.5.2
org.antlr antlr4-runtime 4.13.1
org.antlr stringtemplate 3.2.1
org.apache.ant ant 1.10.11
org.apache.ant ant-jsch 1.10.11
org.apache.ant ant-launcher 1.10.11
org.apache.arrow arrow-compression 18.3.0
org.apache.arrow arrow-format 18.3.0
org.apache.arrow arrow-memory-core 18.3.0
org.apache.arrow arrow-memory-netty 18.3.0
org.apache.arrow arrow-memory-netty-buffer-patch 18.3.0
org.apache.arrow arrow-vector 18.3.0
org.apache.avro avro 1.12.1
org.apache.avro avro-ipc 1.12.1
org.apache.avro avro-mapred 1.12.1
org.apache.commons commons-collections4 4.5.0
org.apache.commons commons-compress 1.28.0
org.apache.commons commons-configuration2 2.11.0
org.apache.commons commons-crypto 1.1.0
org.apache.commons commons-lang3 3.19.0
org.apache.commons commons-math3 3.6.1
org.apache.commons commons-text 1.14.0
org.apache.curator curator-client 5.9.0
org.apache.curator curator-framework 5.9.0
org.apache.curator curator-recipes 5.9.0
org.apache.datasketches datasketches-java 6.2.0
org.apache.datasketches datasketches-memory 3.0.2
org.apache.derby derby 10.14.2.0
org.apache.hadoop hadoop-client-runtime 3.4.2
org.apache.hive hive-beeline 2.3.10
org.apache.hive hive-cli 2.3.10
org.apache.hive hive-jdbc 2.3.10
org.apache.hive hive-llap-client 2.3.10
org.apache.hive hive-llap-common 2.3.10
org.apache.hive hive-serde 2.3.10
org.apache.hive hive-shims 2.3.10
org.apache.hive hive-storage-api 2.8.1
org.apache.hive.shims hive-shims-0.23 2.3.10
org.apache.hive.shims hive-shims-common 2.3.10
org.apache.hive.shims hive-shims-scheduler 2.3.10
org.apache.httpcomponents httpclient 4.5.14
org.apache.httpcomponents httpcore 4.4.16
org.apache.ivy ivy 2.5.3
org.apache.logging.log4j log4j-1.2-api 2.24.3
org.apache.logging.log4j log4j-api 2.24.3
org.apache.logging.log4j log4j-core 2.24.3
org.apache.logging.log4j log4j-layout-template-json 2.24.3
org.apache.logging.log4j log4j-slf4j2-impl 2.24.3
org.apache.orc orc-core 2.2.0-shaded-protobuf
org.apache.orc orc-format 1.1.1-shaded-protobuf
org.apache.orc orc-mapreduce 2.2.0-shaded-protobuf
org.apache.orc orc-shims 2.2.0
org.apache.poi poi 5.4.1
org.apache.poi poi-ooxml 5.4.1
org.apache.poi poi-ooxml-full 5.4.1
org.apache.poi poi-ooxml-lite 5.4.1
org.apache.thrift libfb303 0.9.3
org.apache.thrift libthrift 0.16.0
org.apache.ws.xmlschema xmlschema-core 2.3.1
org.apache.xbean xbean-asm9-shaded 4.28
org.apache.xmlbeans xmlbeans 5.3.0
org.apache.yetus audience-annotations 0.13.0
org.apache.zookeeper zookeeper 3.9.4
org.apache.zookeeper zookeeper-jute 3.9.4
org.checkerframework checker-qual 3.43.0
org.codehaus.janino commons-compiler 3.0.16
org.codehaus.janino janino 3.0.16
org.datanucleus datanucleus-api-jdo 4.2.4
org.datanucleus datanucleus-core 4.1.17
org.datanucleus datanucleus-rdbms 4.1.19
org.datanucleus javax.jdo 3.2.0-m3
org.eclipse.jetty jetty-alpn-client 10.0.26
org.eclipse.jetty jetty-client 10.0.26
org.eclipse.jetty jetty-http 10.0.26
org.eclipse.jetty jetty-io 10.0.26
org.eclipse.jetty jetty-jndi 10.0.26
org.eclipse.jetty jetty-plus 10.0.26
org.eclipse.jetty jetty-proxy 10.0.26
org.eclipse.jetty jetty-security 10.0.26
org.eclipse.jetty jetty-server 10.0.26
org.eclipse.jetty jetty-servlet 10.0.26
org.eclipse.jetty jetty-servlets 10.0.26
org.eclipse.jetty jetty-util 10.0.26
org.eclipse.jetty jetty-webapp 10.0.26
org.eclipse.jetty jetty-xml 10.0.26
org.fusesource.leveldbjni leveldbjni-all 1.8
org.glassfish.hk2 hk2-api 2.6.1
org.glassfish.hk2 hk2-locator 2.6.1
org.glassfish.hk2 hk2-utils 2.6.1
org.glassfish.hk2 osgi-resource-locator 1.0.3
org.glassfish.hk2.external aopalliance-repackaged 2.6.1
org.glassfish.hk2.external jakarta.inject 2.6.1
org.glassfish.jersey.containers jersey-container-servlet 2.41
org.glassfish.jersey.containers jersey-container-servlet-core 2.41
org.glassfish.jersey.core jersey-client 2.41
org.glassfish.jersey.core jersey-common 2.41
org.glassfish.jersey.core jersey-server 2.41
org.glassfish.jersey.inject jersey-hk2 2.41
org.hibernate.validator hibernate-validator 6.2.5.Final
org.ini4j ini4j 0.5.4
org.javassist javassist 3.29.2-GA
org.jboss.logging jboss-logging 3.4.1.Final
org.jdbi jdbi 2.63.1
org.jline jline 3.27.1-jdk8
org.joda joda-convert 1.7
org.json4s json4s-ast_2.13 4.0.7
org.json4s json4s-core_2.13 4.0.7
org.json4s json4s-jackson-core_2.13 4.0.7
org.json4s json4s-jackson_2.13 4.0.7
org.json4s json4s-scalap_2.13 4.0.7
org.locationtech.jts jts-core 1.20.0
org.lz4 lz4-java 1.8.0-databricks-1
org.mlflow mlflow-spark_2.13 2.22.1
org.objenesis objenesis 3.4
org.postgresql postgresql 42.6.1
org.roaringbitmap RoaringBitmap 1.2.1
org.rosuda.REngine REngine 2.1.0
org.scala-lang scala-compiler_2.13 2.13.16
org.scala-lang scala-library_2.13 2.13.16
org.scala-lang scala-reflect_2.13 2.13.16
org.scala-lang.modules scala-collection-compat_2.13 2.11.0
org.scala-lang.modules scala-java8-compat_2.13 0.9.1
org.scala-lang.modules scala-parallel-collections_2.13 1.2.0
org.scala-lang.modules scala-parser-combinators_2.13 2.4.0
org.scala-lang.modules scala-xml_2.13 2.4.0
org.scala-sbt test-interface 1.0
org.scalacheck scalacheck_2.13 1.18.0
org.scalactic scalactic_2.13 3.2.19
org.scalanlp breeze-macros_2.13 2.1.0
org.scalanlp breeze_2.13 2.1.0
org.scalatest scalatest-compatible 3.2.19
org.scalatest scalatest-core_2.13 3.2.19
org.scalatest scalatest-diagrams_2.13 3.2.19
org.scalatest scalatest-featurespec_2.13 3.2.19
org.scalatest scalatest-flatspec_2.13 3.2.19
org.scalatest scalatest-freespec_2.13 3.2.19
org.scalatest scalatest-funspec_2.13 3.2.19
org.scalatest scalatest-funsuite_2.13 3.2.19
org.scalatest scalatest-matchers-core_2.13 3.2.19
org.scalatest scalatest-mustmatchers_2.13 3.2.19
org.scalatest scalatest-propspec_2.13 3.2.19
org.scalatest scalatest-refspec_2.13 3.2.19
org.scalatest scalatest-shouldmatchers_2.13 3.2.19
org.scalatest scalatest-wordspec_2.13 3.2.19
org.scalatest scalatest_2.13 3.2.19
org.slf4j jcl-over-slf4j 2.0.16
org.slf4j jul-to-slf4j 2.0.16
org.slf4j slf4j-api 2.0.16
org.slf4j slf4j-simple 1.7.25
org.threeten threeten-extra 1.8.0
org.tukaani xz 1.10
org.typelevel algebra_2.13 2.8.0
org.typelevel cats-kernel_2.13 2.8.0
org.typelevel spire-macros_2.13 0.18.0
org.typelevel spire-platform_2.13 0.18.0
org.typelevel spire-util_2.13 0.18.0
org.typelevel spire_2.13 0.18.0
org.wildfly.openssl wildfly-openssl 1.1.3.Final
org.xerial sqlite-jdbc 3.42.0.0
org.xerial.snappy snappy-java 1.1.10.3
org.yaml snakeyaml 2.0
oro oro 2.0.8
pl.edu.icm JLargeArrays 1.5
software.amazon.cryptools AmazonCorrettoCryptoProvider 2.5.0-linux-x86_64
stax stax-api 1.0.1