Updating on-premise JDBC extractors

Upcoming Java 25 Requirement for OPCs and OPEs

Starting April 2026, upgrading to the latest On-Premise Client (OPC) or On-Premise Extractor (OPE) will require Java 25. Current installations will remain unaffected (non-breaking change) unless you choose to update to the newest OPC or OPE versions. We will provide finalized Java requirements once the release versions are confirmed.

While no immediate action is required, we recommend planning your Java environment updates now to ensure a smooth transition.

The JDBC extractor, available from the Celonis Platform Download Portal, allows you to connect to any SQL database. This page provides instructions and information on updating existing on-premise JDBC extractor installations.

Updating an on-premises JDBC Extractor follows a side-by-side upgrade methodology. This approach involves installing the new version into a separate, clean directory rather than overwriting your existing installation. By maintaining the previous version's directory intact, you ensure a seamless migration of your custom configurations while providing a reliable fallback point should you need to revert to the previous state.

Note

Before you start the upgrade, ensure your server meets Prerequisites for on-premise JDBC Extractors.

To update an on-premise JDBC extractor:

If you've customized your JDBC extractor's configuration files, save a copy of these files from your existing extractor directory.
Important
Failing to perform this step could result in permanent data loss if you overwrite your content with the default files included in the extractor package.
- YAML configuration: application-local.yml
- Proxy configuration: proxy.yml
- XML configuration: CelonisJdbcExtractor.xml
In a sandbox environment, create a new directory, and install the new version of the JDBC Extractor package into it.
Important
When the package runs, it creates subdirectories and extracts the files from the included JAR file. It's critical use this this in a new directory (side-by-side approach), rather than replacing the JAR file in your previous version's directory, so that the new package can correctly validate the installation.
If updating from version 2.77.0 (2023-02-27) or earlier: You must use the updated version of CelonisJdbcExtractor.xml included with the new extractor package. To do so, transfer any customizations from your copy of CelonisJdbcExtractor.xml to the CelonisJdbcExtractor.xml file supplied with the new version.
If updating from version 2.82.0 (2023-08-10) or later: Replace the CelonisJdbcExtractor.xml file in your new directory with your customized version.
Replace the following files in your new directory with your customized versions:
- YAML configuration: application-local.yml
- Proxy configuration: proxy.yml
Note
You can safely overwrite the default versions of these files included in the new extractor package.
Start the new version of the extractor following the instructions in "Step D: Run The Extractor" in How do I set up an on-premise Extractor?
Verify the extractor is working correctly in your sandbox environment. If it is, follow the above steps to install the extractor to your production environment.
Once the updated version of the extractor is working in your production environment, stop and uninstall the older version of the extractor.

JDBC Extractor change history

Version	Release date	Changes
4.24.0	2026-02-19	Added - Default rolling logs cleanup policy. See: (Recommended) Managing JDBC Extractor log files. Added - Improved logging for proxy configuration to distinguish between proxy and internal-proxy. Fixed - Hanging of subsequent extractions by improved handling of active extractions.
4.23.0	2026-02-04	Fixed - Updated internal dependencies not impacting users directly.
4.19.0	2025-12-11	Fixed - MSSQL Increase Filter DateTime Precision using Datetime2.
4.17.0	2025-12-03	Added - Support for BigInteger values from Impala driver.
4.16.0	2025-11-20	Fixed - Updated dependencies (further details available in the Download Portal).
4.15.2	2025-11-13	Fixed - Extraction preview without primary keys. Fixed - Log discrepancy between local extractor and Celonis Platform. Fixed - Row ID column extraction for JDBC Extraction Editor.
4.9.0	2025-10-10	Added - Use Default value of dynamic parameter while running Full extraction created via SQL Editor. Added - Notification in the logs about outdated on-prem extractor version. Fixed - Replication Cockpit issue when changes happened during initialization might be ignored. Fixed - Resolution format for DATE type using a valid syntax for IBM DB2 AS400.
4.8.2	2025-09-24	Added - Cloudera Impala Driver for extractions using Direct JDBC Connection
4.7.3	2025-09-01	Added - Amazon Redshift: External tables now appear when using Driver Metadata as metadata source
4.6.1	2025-08-01	Improved messages during connection issues. Increased default connection timeout to 60 seconds. Databricks Use Catalog Statement, now uses quoted name. Windows On Premise extractor installation no longer requires winutils.exe Added - Security check for Connection parameters. Added - Pseudonymize FLOAT columns. Added - Uplinked connections for domains containing the term 'integration' are recognized.
4.4.9	2025-07-18	Improved messages during connection issues.
4.4.5	2025-07-09	Added - Support for the OAuth2-based authentication and authorization to the dockerized version of the On-Prem JDBC (Database) Extractor. See: OAuth authentication in Docker Added - Detailed logging for Replication Cockpit.
4.3.0	2025-06-26	Fixed - Database connection test for Trino. Updated third-party dependencies.
4.2.3	2025-06-12	Fixed - Increased SQL editor timeout preview. Fixed - Fully qualified column references supported during duplicate removal in visual mode. Fixed - Stored Oracle timestamp precision to 6 digits. Fixed - Timestamp precision used in extraction filter queries to 6 digits. Applies to Oracle, Postgres, IBM-DB2, Athena, MSSQL, SAP-HANA, BigQuery. Fixed - SQL editor preview for IBM DB2 AS400 database.
4.1.3	2025-05-19	Added - Extractions Editor and AI Assistant general availability release. Fixed - Snowflake connection host field validation. Fixed - Improved observability for Query execution metrics. Fixed - Improve error message for binary columns extraction for Snowflake Bulk mode. Fixed - Delta load issue due to timestamp precision for Snowflake. Fixed - Missing specified schema in data connection during querying tables for Databricks.
3.8.3	2025-04-17	Added - Add Parameter selection box on Extractions SQL Editor. Fixed - Updated JDBC driver for MySQL to v9.2.0. Fixed - Updated JDBC driver Snowflake to v3.23.2 Fixed - Broken log generation in JDBC on-prem extractor Docker image. Fixed - Templates functionality on JDBC extraction configuration.
3.7.1	2025-03-27	Added - Increase precision for timestamp to be 6 digits instead of 3 for Databricks extractions. Added - JDBC extractor for NetSuite. Fixed - Support spaces in column names when ordering by primary keys during extraction configuration in visual mode. Fixed - Convert numeric values to string without trailing zeros after the fractional part.
3.6.0	2025-03-03	Requires Java 21 From JDBC extractor version 3.6.0 onward, you must update to Java 21 before installing the JDBC extractor update. Added: Create blank/no-row tables when there is no data in source-system.
3.5.0	2025-02-06	Fixed - Include milliseconds precision for timestamp columns for delta extraction in Databricks connector. Fixed - issue with initialization of Replication Cockpit that affected transformations.
3.4.1	2025-01-15	Added VarChar Optimization v2 support in the Replication Cockpit. Fixed - Extractor JAR file in the on premise package is now signed for the authenticity. Fixed - Binary datatype handling is correctly supported for Snowflake bulk extractions.
3.3.0	2024-12-13	Added JOIN support for Snowflake bulk extractions. Improvements in memory consumption during parquet files uploads.
3.2.0	2024-12-10	Limited availability release of VARCHAR optimization version 2 for on-prem extractors. Limited availability release of Snowflake bulk export feature for JDBC full extractions on direct connections.
3.1.1	2024-11-27	Upgraded MSSQL JDBC driver to 12.8.1. Improved messaging for Snowflake connection test if private key is stored outside of Extractor directory.
3.0.0	2024-10-30	Limited availability release of the Extractions Editor and AI Assistant, allowing to use your source system specific SQL functions and JOINS when connecting to your source system using the JDBC extractor. For more, see: Creating extraction tasks using the Extractions Editor and AI Assistant. Fixed filter based on date columns using Teradata database.
2.106.2	2024-10-16	Added UI indicator when there is a filter for Join Config and removed warning related to the filter. Added support for EXTERNAL table type in Athena.
2.105.0	2024-09-30	Internal improvements.
2.104.1	2024-09-05	Upgraded the Amazon Athena driver to version 2.1.5.1000.
2.103.0	2024-08-23	Fixed preview and limit for Impala connector. Upgraded the IBM DB2 driver. Fixed veracode security issue related to crypto algorithm used by Vault. The existing vault.jar will not work from JDBC v2.103.0. Instead, upgrade to the latest version of the extractor for full functionality. Fixed Extension Provider query for target tables with special characters. Fixed vulnerabilities in JDBC Connection String. Upgraded the Oracle driver.
2.102.1	2024-08-08	Added a readme file listing the libraries that the extractor uses. Made the extractor version naming on the uplink integrations page consistent with the download portal. Fixed some security vulnerabilities.
2.101.5	2024-07-23	Improved the date selection for downloading on-premise extractor logs. Upgraded various third-party libraries to their latest version. We now prevent the extractor starting if your proxy configuration is invalid.
2.100.0	2024-07-05	You can now search BigQuery tables using one or more projects from the search bar. Upgraded MSSQL driver to the latest (12.6.3.jre11) and related msal4j dependency to 1.15.1. Fixed a bug where on-premise extractor failed to shut down immediately after closing it. Fixed a bug where on-premise extractor failed to report issues with broken proxy configuration at start up.
2.99.0	2024-06-19	We've added the capability to increase parallel executions up to 40 through local configurations. For more information, see: Configuring an on-premise extractor. Upgrade the Google BigQuery driver to version 1.5.4.1008. Fixed struct columns parsing in Oracle tables after Oracle JDBC driver upgrade.
2.98.5	2024-06-10	Added support for materialized views and snapshots in BigQuery. Retry file upload on receiving HTTP status code 408 to overcome temporary CloudFlare issue. Optimized the way file upload works to improve performance and avoid connection timeout issues. Upgraded Snowflake JDBC driver to version 3.16.1 to fix issues with nested paths on Windows machines. Upgraded several third party libraries to fix vulnerabilities.
2.97.1	2024-05-29	For Oracle extractions, you can now configure type casting between Integer and Float.
2.96.1	2024-05-14	If you’re extracting data from a Google BigQuery database, you can now get data from external tables, which reference data stored outside BigQuery, as well as from standard BigQuery tables. Fixed an issue where a schedule consisting of an uplinked job was conflicting with a direct job.
2.95.2	2024-05-03	For filters, fixed an additional comma (,) that was being added to an IN clause. Fixed an issue with table hash keys colliding for metadata caching, which skipped the extraction of one of the tables.
2.94.0	2024-04-08	Delta loads strictly default to canceling the job if the metadata from your source system changes, to avoid data inconsistency (Option A). Do a full load if this happens. Fixed a bug where MAX_STRING_LENGTH was not getting applied to a column during the table configuration.
2.93.2	2024-03-25	Added caching for InformationSchema-based and SampleQuery-based retrieved metadata. Updated third-party dependencies.
2.92.1	2024-03-06	Connect directly to Oracle EBS as a Cloud Connection - no need to use an uplink. The extractor now shows tables only from the schema specified in the Connection Configuration. MS SQL extractions for tables with clustered column index now work when the metadata source is SAMPLE_QUERY.
2.91.5	2024-02-16	Added support for Optimizer hints while extracting data from Oracle databases. The Windows installer executable is now signed with a Celonis certificate. Unified the three metadata retrieval approaches so that they all return consistent information. Upgraded to newer versions of the logback and json libraries.
2.90.0	2024-01-26	Fixed Databricks extraction issues when using the default catalog and default database. Fixed some security vulnerabilities.
2.89.0	2024-01-11	For driver metadata, fixed Microsoft SQL server extraction issue when table has clustered columnstore index. Optimized memory allocations during extractions.
2.88.1	2023-12-13	Beta release of Oracle Smart Extraction, which parallelizes extractions of larger Oracle tables to reduce data extraction times. The feature is shipped disabled. If you want to try it out, we recommend that you do so in a sandbox environment. To get Oracle Smart Extraction enabled, talk to your Celonis point of contact or create a support ticket. Fix for an SAP HANA filter parser error when concatenation is used.
2.87.0	2023-10-26	Fixed deviations for dates earlier than 1900 due to timezone changes. Fixed an issue with the Snowflake driver for a new installation on Microsoft Windows. SQL ID will now be logged for Oracle if debug mode is enabled.
2.86.0	2023-09-22	Enabled STRING to DATETIME conversion for BigQuery and Trino. For Oracle, we’ve improved the query for the INFORMATION_SCHEMA metadata source. Quotes in the filter statement are now recognized. Fixed some security vulnerabilities. Fixed an issue for the custom BigQuery driver where classes were not loaded in the correct order. Fixed an issue for uplinked extractors using a proxy configuration.
2.85.0 (2023-09-01)	2023-09-01	Upgraded the JDBC extractor’s internal libraries. If you’re linking the BigQuery driver, you’ll need to exclude all SLF4J .jar files from the driver package. On Microsoft Windows, we’ve changed the JDBC extractor’s dependency from the Microsoft Visual C++ 2010 Redistributable Package to the Microsoft Visual C++ 2015-2019 Redistributable Package (x64). Install that package when you install this version of the JDBC extractor. Wildcards in Snowflake metadata calls are now escaped to improve load.
2.84.0	2023-08-10	Added support for Oracle CLOB (Character Large Object) and NCLOB (National Character Large Object) data types.
2.83.0	2023-06-21	Extractions that hang can be resumed from the last table, instead of restarting them. Upgraded the driver for Snowflake to version 3.13.33. Upgraded the driver for Athena to version 2.0.36. Upgraded the driver for IBM DB2 to version 11.5. Implemented TO_DATE functionality for Oracle filters.
2.82.0 and 2.80.1	2023-08-10	Upgraded the JDBC extractor to Java 17. From version 2.80.1, you're required to upgrade your Java version to at least Java 17 to ensure compatibility and leverage the latest enhancements and security features. With version 2.82.0, we’ve removed some additional steps from the upgrade process, so use this or a later version of the extractor package.
2.77.0	2023-02-27	Upgraded MSSQL-JDBC Driver to latest version. Set `trustServerCertificate=true` and `encrypt=false` by default in case they are not set in the additional properties field (required by driver upgrade).
2.76.0	2023-02-15	Upgraded MySQL driver to latest version.
2.75.0	2023-02-01	Fixed security vulnerabilities.
2.71.0	2022-11-25	Added support for extractions from Databricks. Oracle DB: Fix for scenarios where the driver metadata was used, even if Information Schema is selected. Improved clean-up of changelog tables for real-time extractions by doing the clean-up in chunks.
2.70.0	2022-11-16	Improved metadata query for Oracle databases.
2.69.0	2022-10-27	Added support for Analytical Views for SAP HANA. Fixed the feature to clear the metadata cache.
2.67.0	2022-09-28	Added support for Analytical Views for SAP HANA.
2.66.0	2022-09-15	Fixed Java-based vulnerabilities.
2.65.0	2022-09-09	Added support for key pair authentication for Snowflake.
2.64.0	2022-08-24	Extraction of synonyms for Oracle databases.
2.63.0	2022-06-30	Extended the logging messages.
2.62.0	2022-06-30	Enabled Materialized view for the Postgres database. Added support for Attribute(Joined) views for SAP HANA.
2.61.0	2022-06-30	BigQuery Get Tables from Additional Projects. BigQuery ADC authentication (hosted in GCP). Fixed test connection issue for BigQuery. Fixed input box for BigQuery data connection form.
2.60.0	2022-06-16	Minor improvements and fixes.
2.59.0	2022-06-02	Added SHA-256 and SHA-512 support. Fixed the order of delete and insert executed at the same time.
2.57.0	2022-05-05	For Google BigQuery, fixed duplicate records caused by a LIMIT/OFFSET in SELECT queries without ORDER BY (primary key) clause not guaranteeing proper pagination. Added validation to inform the user with an error message if a primary key is not selected. Fixed NullPointerException in uploading results leading to duplicated push jobs. Extended invalidate cache also for real-time integration column selection.
2.56.0	2022-05-05	Added initialization for JDBC real-time via Replication Cockpit. Fixed out of memory error due to unlimited threads. Fixed column order changes after deselecting some of the columns in JDBC.
2.55.0	2022-05-05	Added a database connection timeout setting in the UI. This overwrites the local timeout in application-local.yml in case of uplinked connections.
2.54.0	2022-05-05	Added an authentication option SERVICE_ACCOUNT_AUTHENTICATION for Google BigQuery database connection. Inputs are the service authentication account email ID and the service account key file. Improved logging for JDBC extraction in DEBUG extraction mode. Added support for Vertica database type. Performance improvements for JDBC data extractions on the upload mechanism.
2.51.0	2022-05-05	Removed the option to include a changelog time stamp in JDBC real-time extractions, and made it the default, to support real-time transformations.
2.50.0	2022-05-05	Extended duplicate removal for all database types by adding properties in application-local.yml for uplink database connections. To enable this, add the following to the application-local.yml file: duplicate-removal: enabled: true strategy: CLOUD
2.49.0	2022-05-05	Logical change in reading change log tables in the JDBC real-time scenario to improve performance. Fixed change in metadata source not being consistently reflected in the metadata query.

Updating on-premise JDBC extractors

JDBC Extractor change history

Search results