Edit: ok, all files are going to be processed except the ones you set to ignore.
I meant to say that I'm only selecting one file to test if the processing would work again.
Since it's taking that long I'm assuming that all logs are being processed:
root@Expedition:/home/expedition# tail -f /tmp/error_logCoCo
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in [jar:file:/opt/Spark/extraLibraries/slf4j-nop-1.7.25.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in [jar:file:/opt/Spark/spark-2.4.3-bin-hadoop2.7/jars/slf4j-log4j12-1.7.16.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.helpers.NOPLoggerFactory]
---- CREATING SPARK Session:
warehouseLocation:/datastore/spark-warehouse
+------------+--------+--------------------+----+------------+
| fwSerial|panosver| csvpath|size|afterProcess|
+------------+--------+--------------------+----+------------+
|011901016881| 8.1.0|/PALogs/paloalto_...|4957| Compress|
|011901016881| 8.1.0|/PALogs/paloalto_...|4332| Compress|
|011901016881| 8.1.0|/PALogs/paloalto_...|4629| Compress|
|011901016881| 8.1.0|/PALogs/paloalto_...|3861| Compress|
|011901016881| 8.1.0|/PALogs/paloalto_...|4629| Compress|
|011901016881| 8.1.0|/PALogs/paloalto_...|3840| Compress|
|011901016881| 8.1.0|/PALogs/paloalto_...|5264| Compress|
|011901016881| 8.1.0|/PALogs/paloalto_...|3892| Compress|
|011901016881| 8.1.0|/PALogs/paloalto_...|3646| Compress|
|011901016881| 8.1.0|/PALogs/paloalto_...|4783| Compress|
|011901016881| 8.1.0|/PALogs/paloalto_...|4772| Compress|
|011901016881| 8.1.0|/PALogs/paloalto_...|4619| Compress|
|011901016881| 8.1.0|/PALogs/paloalto_...|5161| Compress|
|011901016881| 8.1.0|/PALogs/paloalto_...|4752| Compress|
|011901016881| 8.1.0|/PALogs/paloalto_...|4076| Compress|
|011901016881| 8.1.0|/PALogs/paloalto_...|4895| Compress|
|011901016881| 8.1.0|/PALogs/paloalto_...|3789| Compress|
|011901016881| 8.1.0|/PALogs/paloalto_...|4742| Compress|
|011901016881| 8.1.0|/PALogs/paloalto_...|5059| Compress|
|011901016881| 8.1.0|/PALogs/paloalto_...|5407| Compress|
+------------+--------+--------------------+----+------------+
Memory: 6373m
LogCollector&Compacter called with the following parameters:
Parameters for execution
Master[processes]:............ local[3]
Available RAM (MB):........... 6525952
User:......................... admin
debug:........................ false
Parameters for Job Connections
Task ID:...................... 152
My IP:........................ 10.3.1.30
Expedition IP:................ 10.3.1.30:3306
Time Zone:.................... Europe/Helsinki
dbUser (dbPassword):.......... root (************)
projectName:.................. demo
Parameters for Data Sources
App Categories (source):........ (Expedition)
CSV Files Path:................./tmp/1572951103_traffic_files.csv
Parquet output path:.......... file:///PALogs/connections.parquet
Temporary folder:............. /datastore
---- AppID DB LOAD:
Application Categories loading...
Application Categories loaded
+------------+--------+--------------------+----+------------+--------+---+---------------+
| fwSerial|panosver| csvpath|size|afterProcess| grouped|row|accumulatedSize|
+------------+--------+--------------------+----+------------+--------+---+---------------+
|011901016881| 8.1.0|/PALogs/paloalto_...|4957| Compress|grouping| 1| 4957.0|
|011901016881| 8.1.0|/PALogs/paloalto_...|4332| Compress|grouping| 2| 9289.0|
|011901016881| 8.1.0|/PALogs/paloalto_...|4629| Compress|grouping| 3| 13918.0|
|011901016881| 8.1.0|/PALogs/paloalto_...|3861| Compress|grouping| 4| 17779.0|
|011901016881| 8.1.0|/PALogs/paloalto_...|4629| Compress|grouping| 5| 22408.0|
|011901016881| 8.1.0|/PALogs/paloalto_...|3840| Compress|grouping| 6| 26248.0|
|011901016881| 8.1.0|/PALogs/paloalto_...|5264| Compress|grouping| 7| 31512.0|
|011901016881| 8.1.0|/PALogs/paloalto_...|3892| Compress|grouping| 8| 35404.0|
|011901016881| 8.1.0|/PALogs/paloalto_...|3646| Compress|grouping| 9| 39050.0|
|011901016881| 8.1.0|/PALogs/paloalto_...|4783| Compress|grouping| 10| 43833.0|
|011901016881| 8.1.0|/PALogs/paloalto_...|4772| Compress|grouping| 11| 48605.0|
|011901016881| 8.1.0|/PALogs/paloalto_...|4619| Compress|grouping| 12| 53224.0|
|011901016881| 8.1.0|/PALogs/paloalto_...|5161| Compress|grouping| 13| 58385.0|
|011901016881| 8.1.0|/PALogs/paloalto_...|4752| Compress|grouping| 14| 63137.0|
|011901016881| 8.1.0|/PALogs/paloalto_...|4076| Compress|grouping| 15| 67213.0|
|011901016881| 8.1.0|/PALogs/paloalto_...|4895| Compress|grouping| 16| 72108.0|
|011901016881| 8.1.0|/PALogs/paloalto_...|3789| Compress|grouping| 17| 75897.0|
|011901016881| 8.1.0|/PALogs/paloalto_...|4742| Compress|grouping| 18| 80639.0|
|011901016881| 8.1.0|/PALogs/paloalto_...|5059| Compress|grouping| 19| 85698.0|
|011901016881| 8.1.0|/PALogs/paloalto_...|5407| Compress|grouping| 20| 91105.0|
+------------+--------+--------------------+----+------------+--------+---+---------------+
Selection criteria: 0 < accumulatedSize and accumulatedSize <= 6525952
Processing from lowLimit:0 to highLimit:6525952 with StepLine:6525952
Few logs can fit in this batch:20
8.1.0:/PALogs/paloalto_1_traffic_2019_10_25_last_calendar_day.csv,/PALogs/paloalto_1_traffic_2019_11_01_last_calendar_day.csv,/PALogs/paloalto_1_traffic_2019_11_02_last_calendar_day.csv,/PALogs/paloalto_1_traffic_2019_10_31_last_calendar_day.csv,/PALogs/paloalto_1_traffic_2019_10_19_last_calendar_day.csv,/PALogs/paloalto_1_traffic_2019_10_17_last_calendar_day.csv,/PALogs/paloalto_1_traffic_2019_11_03_last_calendar_day.csv,/PALogs/paloalto_1_traffic_2019_10_28_last_calendar_day.csv,/PALogs/paloalto_1_traffic_2019_11_05_last_calendar_day.csv,/PALogs/paloalto_1_traffic_2019_10_27_last_calendar_day.csv,/PALogs/paloalto_1_traffic_2019_10_30_last_calendar_day.csv,/PALogs/paloalto_1_traffic_2019_10_22_last_calendar_day.csv,/PALogs/paloalto_1_traffic_2019_11_04_last_calendar_day.csv,/PALogs/paloalto_1_traffic_2019_10_18_last_calendar_day.csv,/PALogs/paloalto_1_traffic_2019_10_26_last_calendar_day.csv,/PALogs/paloalto_1_traffic_2019_10_21_last_calendar_day.csv,/PALogs/paloalto_1_traffic_2019_10_29_last_calendar_day.csv,/PALogs/paloalto_1_traffic_2019_10_20_last_calendar_day.csv,/PALogs/paloalto_1_traffic_2019_10_23_last_calendar_day.csv,/PALogs/paloalto_1_traffic_2019_10_24_last_calendar_day.csv
Logs of format 7.1.x NOT found
Logs of format 8.0.2 NOT found
Logs of format 8.1.0-beta17 NOT found
Logs of format 8.1.0 found
Logs of format 9.0.0 NOT found
Logs of format 9.1.0-beta NOT found
... View more