Description
Integrates Apache Hudi with Spark 3.5 and Scala 2.13 through runtime bundles and utility JARs. It is useful for data engineers building lakehouse, table-format, or streaming data workflows on JVM-based clusters.
Data platform libraries can read, write, and reorganize large datasets. Validate schemas, credentials, storage paths, retention rules, and rollback plans before using them with production data.