Improvements in Release 2.5 of the Splice Machine Database

This topic describes significant improvements in the Splice Machine Database, which is used in both our Database-as-Service and On-Premise Database products with notes for each patch release since the initial release of version 2.5, in these sections:

2.5.0.1805 Patch Release 5-Feb-18

JIRA-ID Description
SPLICE-2022 spark job description for compaction

2.5.0.1804 Patch Release 28-Jan-18

JIRA-ID Description
No improvements in this patch release.

2.5.0.1803 Patch Release 21-Jan-18

JIRA-ID Description
SPLICE-1987 Add fully qualified table name into HBASE TableDisplayName Attribute

2.5.0.1802 Patch Release 14-Jan-18

JIRA-ID Description
SPLICE-1975 Allow JavaRDD<Row> to be passed for CRUD operations in SplicemachineContext
SPLICE-1991 Add info to Spark UI for Compaction jobs to indicate presence of Reference Files

2.5.0.1749 Patch Release 26-Dec-17

JIRA-ID Description
SPLICE-1973 Exclude Kafka jars from splice-uber.jar for all platforms
SPLICE-1984 Parallelize MultiProbeTableScan and Union Operations

2.5.0.1748 Patch Release 18-Dec-17

JIRA-ID Description
No improvements in this patch release.

2.5.0.1747 Patch Release 08-Dec-17

JIRA-ID Description
SPLICE-1351 Upgrade Sketching Library from 0.8.1 - 0.8.4
SPLICE-1948 Increase test timeout
SPLICE-1948 Initialize Splice Spark context with user context
SPLICE-1951 Remove protobuf installation instructions

2.6.1.1745 Patch Release 08-Dec-17

2.6 was an interim release that has been folded in to 2.5

JIRA-ID Description
SPLICE-1302 Add minimum parallelism for Spark shuffles
SPLICE-1951 Remove protobuf installation instructions from
SPLICE-1948 Initialize Splice Spark context with user context
SPLICE-1948 Increase test timeout

Splice Patch Release 2.5.0.1735

JIRA ID Description
SPLICE-398 Support drop view if exists
SPLICE-949 Built-in function ROUND added
SPLICE-1222 Implement in-memory subtransactions
SPLICE-1351 Upgrade Sketching Library from 0.8.1 - 0.8.4
SPLICE-1372 Control-side query control
SPLICE-1479 Iterator based stats collection
SPLICE-1497 Add flag for inserts to skip conflict detection
SPLICE-1500 Skip WAL for unsafe imports
SPLICE-1513 Create Spark Adapter that supports both 1.6.x and 2.1.0 versions of Spark
SPLICE-1516 Enable compression for WritePipeline
SPLICE-1555 Enable optimizer trace info for costing
SPLICE-1568 Core Spark Adapter Functionality With Maven Build
SPLICE-1619 Update the Spark Adapter to 2.1.1
SPLICE-1681 Introduce query hint "skipStats" after a table identifier to bypass fetching real stats from dictionary tables
SPLICE-1702 Removed LocatedRow construct from the execution tree
SPLICE-1703 Changed size==0 to isEmpty()
SPLICE-1714 Ignore "should not give a splitkey that equates to startkey" exception
SPLICE-1725 External table documentation updated
SPLICE-1729 Handle 'drop table table_name if exists'
SPLICE-1733 Support type conversion Varchar to INT
SPLICE-1739 Added CREATE SCHEMA IF NOT EXISTS functionality
SPLICE-1752 Support inserting int types to char types
SPLICE-1756 Introduce database property collectIndexStatsOnly to specify the collect stats behavior
SPLICE-1760 Enhancement to provide corresponding Spark JobID when Splice jobs or queries are submitted through Spark
SPLICE-1785 Too many tasks are launched in the last stage of bulk import
SPLICE-1834 Remove EFS FileSystem
SPLICE-1835 Remove MBeanResultSet
SPLICE-1836 Remove SpliceCsvTokenizer
SPLICE-1837 Remove Old Cost Estimate Implementation.
SPLICE-1838 Remove Left Over Aggregate Plumbing
SPLICE-1839 Remove Serial Encoding Package
SPLICE-1840 Remove Dead PhysicalStatsStore
SPLICE-1841 Remove ScanInfo class and Interfaces
SPLICE-1842 Derby Utils Dead Code Cleanup
SPLICE-1845 Tweak Kryo Serde for Missing Elements
SPLICE-1851 Remove concurrent.traffic package
SPLICE-1873 Added documentation for GET_SESSION_INFO
SPLICE-1875 Added documentation for GET_RUNNING_OPERATIONS and KILL_OPERATION
SPLICE-1879 KeyBy Function on Control is a multimap index vs. a map function
SPLICE-1880 Modify ReduceByKey to execute lazily and not use Multimaps.
Internal Added SplicemachineContext.g
Internal Bcast implementation dataset vs rddetConnection() to enable commit/rollback in Scala
Internal Bcast implementation dataset vs rdd
Internal Add logging to Vacuum process

Splice Patch Release 2.5.0.1729 01-Aug-17

JIRA ID Description
SPLICE-398 Support 'drop view if exists' for 2.5
SPLICE-774 Support upgrade from K2 (2.5)
SPLICE-1479 iterator based stats collection (2.5)
SPLICE-1516 Enable compression for WritePipeline
SPLICE-1500 Skip WAL for unsafe imports (2.5)
SPLICE-1555 Enable optimizer trace info for costing
SPLICE-1681 Introduce query hint "skipStats"
SPLICE-1701 JXM mbean server for cache and enginedriver exec service
SPLICE-1729 Support 'drop table t_name if exists' for 2.5
SPLICE-1756 introduce database property collectIndexStatsOnly to specify the collect stats behavior
SPLICE-1769 Improve distributed boot process
Internal Bcast implementation dataset vs rdd
Internal prune query blocks based on unsatisfiable conditions
Internal Add logging to Vacuum process
Internal Restore breaks transaction semantics(2.5)

2.5 GA Release 01-Mar-17

JIRA ID Description
Internal Performance enhancement on TPCC, TPCH, and Backup/Restore.
Internal Statistics enhancements, including histograms.

For a full list of JIRA’s for the Community/Open Source software, see https://splice.atlassian.net