Skip to content

Commit 0511a31

Browse files
authored
Merge pull request #675 from AbsaOSS/release/2.7.0
Release Cobrix v2.7.0
2 parents bf57584 + 0c4dea0 commit 0511a31

File tree

11 files changed

+52
-25
lines changed

11 files changed

+52
-25
lines changed

README.md

Lines changed: 42 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -74,13 +74,13 @@ You can link against this library in your program at the following coordinates:
7474
</tr>
7575
<tr>
7676
<td>
77-
<pre>groupId: za.co.absa.cobrix<br>artifactId: spark-cobol_2.11<br>version: 2.6.11</pre>
77+
<pre>groupId: za.co.absa.cobrix<br>artifactId: spark-cobol_2.11<br>version: 2.7.0</pre>
7878
</td>
7979
<td>
80-
<pre>groupId: za.co.absa.cobrix<br>artifactId: spark-cobol_2.12<br>version: 2.6.11</pre>
80+
<pre>groupId: za.co.absa.cobrix<br>artifactId: spark-cobol_2.12<br>version: 2.7.0</pre>
8181
</td>
8282
<td>
83-
<pre>groupId: za.co.absa.cobrix<br>artifactId: spark-cobol_2.13<br>version: 2.6.11</pre>
83+
<pre>groupId: za.co.absa.cobrix<br>artifactId: spark-cobol_2.13<br>version: 2.7.0</pre>
8484
</td>
8585
</tr>
8686
</table>
@@ -91,17 +91,17 @@ This package can be added to Spark using the `--packages` command line option. F
9191

9292
### Spark compiled with Scala 2.11
9393
```
94-
$SPARK_HOME/bin/spark-shell --packages za.co.absa.cobrix:spark-cobol_2.11:2.6.11
94+
$SPARK_HOME/bin/spark-shell --packages za.co.absa.cobrix:spark-cobol_2.11:2.7.0
9595
```
9696

9797
### Spark compiled with Scala 2.12
9898
```
99-
$SPARK_HOME/bin/spark-shell --packages za.co.absa.cobrix:spark-cobol_2.12:2.6.11
99+
$SPARK_HOME/bin/spark-shell --packages za.co.absa.cobrix:spark-cobol_2.12:2.7.0
100100
```
101101

102102
### Spark compiled with Scala 2.13
103103
```
104-
$SPARK_HOME/bin/spark-shell --packages za.co.absa.cobrix:spark-cobol_2.13:2.6.11
104+
$SPARK_HOME/bin/spark-shell --packages za.co.absa.cobrix:spark-cobol_2.13:2.7.0
105105
```
106106

107107
## Usage
@@ -238,17 +238,17 @@ to decode various binary formats.
238238

239239
The jars that you need to get are:
240240

241-
* spark-cobol_2.12-2.6.11.jar
242-
* cobol-parser_2.12-2.6.11.jar
241+
* spark-cobol_2.12-2.7.0.jar
242+
* cobol-parser_2.12-2.7.0.jar
243243
* scodec-core_2.12-1.10.3.jar
244244
* scodec-bits_2.12-1.1.4.jar
245245
* antlr4-runtime-4.8.jar
246246

247247
After that you can specify these jars in `spark-shell` command line. Here is an example:
248248
```
249-
$ spark-shell --packages za.co.absa.cobrix:spark-cobol_2.12:2.6.11
249+
$ spark-shell --packages za.co.absa.cobrix:spark-cobol_2.12:2.7.0
250250
or
251-
$ spark-shell --master yarn --deploy-mode client --driver-cores 4 --driver-memory 4G --jars spark-cobol_2.12-2.6.11.jar,cobol-parser_2.12-2.6.11.jar,scodec-core_2.12-1.10.3.jar,scodec-bits_2.12-1.1.4.jar,antlr4-runtime-4.8.jar
251+
$ spark-shell --master yarn --deploy-mode client --driver-cores 4 --driver-memory 4G --jars spark-cobol_2.12-2.7.0.jar,cobol-parser_2.12-2.7.0.jar,scodec-core_2.12-1.10.3.jar,scodec-bits_2.12-1.1.4.jar,antlr4-runtime-4.8.jar
252252
253253
Setting default log level to "WARN".
254254
To adjust logging level use sc.setLogLevel(newLevel). For SparkR, use setLogLevel(newLevel).
@@ -315,11 +315,11 @@ Creating an uber jar for Cobrix is very easy. Steps to build:
315315

316316
You can collect the uber jar of `spark-cobol` either at
317317
`spark-cobol/target/scala-2.11/` or in `spark-cobol/target/scala-2.12/` depending on the Scala version you used.
318-
The fat jar will have '-bundle' suffix. You can also download pre-built bundles from https://github.com/AbsaOSS/cobrix/releases/tag/v2.6.11
318+
The fat jar will have '-bundle' suffix. You can also download pre-built bundles from https://github.com/AbsaOSS/cobrix/releases/tag/v2.7.0
319319

320320
Then, run `spark-shell` or `spark-submit` adding the fat jar as the option.
321321
```sh
322-
$ spark-shell --jars spark-cobol_2.12_3.3-2.7.0-SNAPSHOT-bundle.jar
322+
$ spark-shell --jars spark-cobol_2.12_3.3-2.7.1-SNAPSHOT-bundle.jar
323323
```
324324

325325
> <b>A note for building and running tests on Windows</b>
@@ -1751,6 +1751,33 @@ at org.apache.hadoop.io.nativeio.NativeIO$POSIX.getStat(NativeIO.java:608)
17511751
A: Update hadoop dll to version 3.2.2 or newer.
17521752

17531753
## Changelog
1754+
- #### 2.7.0 released 8 April 2024.
1755+
- [#666](https://github.com/AbsaOSS/cobrix/issues/666) Added support for record length value mapping.
1756+
```scala
1757+
.option("record_format", "F")
1758+
.option("record_length_field", "FIELD_STR")
1759+
.option("record_length_map", """{"SEG1":100,"SEG2":200}""")
1760+
```
1761+
- [#669](https://github.com/AbsaOSS/cobrix/issues/669) Allow 'V' to be at the end of scaled PICs.
1762+
```cobol
1763+
10 SCALED-DECIMAL-FIELD PIC S9PPPV DISPLAY.
1764+
```
1765+
- [#672](https://github.com/AbsaOSS/cobrix/issues/672) Add the ability to parse copybooks with options normally passed to the `spark-cobol` Spark data source.
1766+
```scala
1767+
// Same options that you use for spark.read.format("cobol").option()
1768+
val options = Map("schema_retention_policy" -> "keep_original")
1769+
1770+
val cobolSchema = CobolSchema.fromSparkOptions(Seq(copybook), options)
1771+
val sparkSchema = cobolSchema.getSparkSchema.toString()
1772+
1773+
println(sparkSchema)
1774+
```
1775+
- [#674](https://github.com/AbsaOSS/cobrix/issues/674) Extended the usage of indexes for variable record length files with a record length field.
1776+
```scala
1777+
.option("record_length_field", "RECORD-LENGTH")
1778+
.option("enable_indexes", "true") // true by default so can me omitted
1779+
```
1780+
17541781
- #### 2.6.11 released 8 April 2024.
17551782
- [#659](https://github.com/AbsaOSS/cobrix/issues/659) Fixed record length option when record id generation is turned on.
17561783

@@ -1810,6 +1837,9 @@ A: Update hadoop dll to version 3.2.2 or newer.
18101837
- [#521](https://github.com/AbsaOSS/cobrix/issues/521) Fixed index generation and improved performance of variable
18111838
block length files processing (record_format='VB').
18121839

1840+
<details><summary>Older versions</summary>
1841+
<p>
1842+
18131843
- #### 2.5.1 released 24 August 2022.
18141844
- [#510](https://github.com/AbsaOSS/cobrix/issues/510) Fixed dropping of FILLER fields in Spack Schema if the FILLER has OCCURS of GROUPS.
18151845

@@ -1823,9 +1853,6 @@ A: Update hadoop dll to version 3.2.2 or newer.
18231853
- [#501](https://github.com/AbsaOSS/cobrix/issues/501) Fixed decimal field null detection when 'improved_null_detection' is turned on.
18241854
- [#502](https://github.com/AbsaOSS/cobrix/issues/502) Fixed parsing of scaled decimals that have a pattern similar to `SVP9(5)`.
18251855

1826-
<details><summary>Older versions</summary>
1827-
<p>
1828-
18291856
- #### 2.4.10 released 8 April 2022.
18301857
- [#481](https://github.com/AbsaOSS/cobrix/issues/481) ASCII control characters are now ignored instead of being replaced with spaces.
18311858
A new string trimming policy (`keep_all`) allows keeping all control characters in strings (including `0x00`).

cobol-converters/pom.xml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,7 @@
2222
<parent>
2323
<groupId>za.co.absa.cobrix</groupId>
2424
<artifactId>cobrix_2.12</artifactId>
25-
<version>2.7.0-SNAPSHOT</version>
25+
<version>2.7.1-SNAPSHOT</version>
2626
<relativePath>../pom.xml</relativePath>
2727
</parent>
2828

cobol-parser/pom.xml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,7 @@
2222
<parent>
2323
<groupId>za.co.absa.cobrix</groupId>
2424
<artifactId>cobrix_2.12</artifactId>
25-
<version>2.7.0-SNAPSHOT</version>
25+
<version>2.7.1-SNAPSHOT</version>
2626
<relativePath>../pom.xml</relativePath>
2727
</parent>
2828

examples/examples-collection/pom.xml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,7 @@
3131
<scala.compat.version>2.11</scala.compat.version>
3232
<spark.version>2.4.8</spark.version>
3333
<specs.version>2.4.16</specs.version>
34-
<spark.cobol.version>2.6.11</spark.cobol.version>
34+
<spark.cobol.version>2.7.0</spark.cobol.version>
3535
</properties>
3636

3737
<dependencies>

examples/spark-cobol-app/build.sbt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,7 @@ ThisBuild / version := "0.1.0-SNAPSHOT"
2020
ThisBuild / scalaVersion := "2.12.17"
2121

2222
val sparkVersion = "3.3.2"
23-
val sparkCobolVersion = "2.6.11"
23+
val sparkCobolVersion = "2.7.0"
2424
val scalatestVersion = "3.2.14"
2525

2626
ThisBuild / libraryDependencies ++= Seq(

examples/spark-cobol-app/pom.xml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,7 @@
3131
<scala.compat.version>2.12</scala.compat.version>
3232
<scalatest.version>3.2.14</scalatest.version>
3333
<spark.version>3.3.2</spark.version>
34-
<spark.cobol.version>2.6.11</spark.cobol.version>
34+
<spark.cobol.version>2.7.0</spark.cobol.version>
3535
</properties>
3636

3737
<dependencies>

examples/spark-cobol-s3-standalone/pom.xml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -32,7 +32,7 @@
3232
<scala.compat.version>2.11</scala.compat.version>
3333
<scalatest.version>3.2.3</scalatest.version>
3434
<spark.version>2.4.8</spark.version>
35-
<spark.cobol.version>2.6.11</spark.cobol.version>
35+
<spark.cobol.version>2.7.0</spark.cobol.version>
3636
<hadoop.version>3.2.4</hadoop.version>
3737
</properties>
3838

examples/spark-cobol-s3/pom.xml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -32,7 +32,7 @@
3232
<scala.compat.version>2.11</scala.compat.version>
3333
<scalatest.version>3.2.14</scalatest.version>
3434
<spark.version>2.4.8</spark.version>
35-
<spark.cobol.version>2.6.11</spark.cobol.version>
35+
<spark.cobol.version>2.7.0</spark.cobol.version>
3636
</properties>
3737

3838
<dependencies>

pom.xml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,7 @@
2222
<groupId>za.co.absa.cobrix</groupId>
2323
<artifactId>cobrix_2.12</artifactId>
2424

25-
<version>2.7.0-SNAPSHOT</version>
25+
<version>2.7.1-SNAPSHOT</version>
2626

2727
<packaging>pom</packaging>
2828

spark-cobol/pom.xml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,7 @@
2222
<parent>
2323
<groupId>za.co.absa.cobrix</groupId>
2424
<artifactId>cobrix_2.12</artifactId>
25-
<version>2.7.0-SNAPSHOT</version>
25+
<version>2.7.1-SNAPSHOT</version>
2626
<relativePath>../pom.xml</relativePath>
2727
</parent>
2828

version.sbt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
ThisBuild / version := "2.7.0-SNAPSHOT"
1+
ThisBuild / version := "2.7.1-SNAPSHOT"

0 commit comments

Comments
 (0)