>python vaers_purging_info.py vaers_purging_info.py Comparing 2022-11-04 vs. 2022-11-11 for blanked out fields open 2022-11-04_VAERS_FLATTENED.csv ... Highest VAERS_ID 2498935 open 2022-11-11_VAERS_FLATTENED.csv ... Highest VAERS_ID 2505475 Fixing date formats Removing delims: symptom_entries Removing delims: VAX_TYPE VAX_MANU VAX_LOT VAX_DOSE_SERIES VAX_ROUTE VAX_SITE VAX_NAME VAX_DATE PRIOR_VAX df_11_cap is only up to max VAERS_ID in 04 1460604 VAERS_IDs in df_04 1458330 VAERS_IDs in df_11_cap 1458171 VAERS_IDs the same in both 1458171 VAERS_IDs in df_04_common 1458171 VAERS_IDs in df_11_common Writing df_04_common.csv Writing df_11_common.csv 159 delayed/gapfill Writing delayed/gapfills df_11_gapfill.csv 2433 deleted in 2022-11-11 Writing df_11_deleted.csv (in 2022-11-04 but not 2022-11-11) Reports Column Bytes 476230 SYMPTOM_TEXT 915155397 (915 MB) 288093 LAB_DATA 84979759 187640 HISTORY 20704413 476230 SPLTTYPE 11207732 45683 CUR_ILL 2479119 21762 VAX_SITE 25067 13437 VAX_ROUTE 15341 12536 VAX_LOT 13443 262 ALLERGIES 6932 56 VAX_DOSE_SERIES 59 0 CAGE_YR 0 0 OTHER_MEDS 0 0 PRIOR_VAX 0 0 STATE 0 0 VAX_MANU 0 0 VAX_NAME 0 0 VAX_TYPE 0 0 V_ADMINBY 0 0 symptom_entries 0 1034587262 total bytes blanked out (solely the fields that were emptied entirely) ... 1.034 GB Writing just the original VAERS reports in 2022-11-04 that had any fields blanked out in 2022-11-11) ... vaers_with_blankings_2022-11-04_to_2022-11-11.csv Done with vaers_purging_info.py at line 388, 2024-03-14 04:53:05.948607 - - - - - - - - - - - - - - - - - - - - - - - - D:\Users\garyha\Documents\_covid\_VAERS\build_original_records\purge>