Hi Hoang, I work in security and get these reports from some of my security tools in .csv format. I've written a script that goes through the raw data and when I give it the "Name" of an issue, it outputs all the ip addresses that are affected by that issue, dedups to only give me unique entries because there are multiples per issue, then gives me the number of affected endpoints. The problem is that at the moment, I have to go one by one for each issue name. What I'd like is to have 2 .csv files, the original (A.csv) with all the data and a second one (B.csv) where I have copied just the "Name" column and deduped it already so the program would recursively run a version of what I've already made for each "Name" and add the affected IPs and count to 2 columns in B.csv. I've attached the .csv files and the python code I have running now.
To be more clear:
use "Name" column from B.csv to run the type of query that CSVParserV3.py is doing on A.csv.
Ouput affected Hosts and a count of affected hosts into rows B and C of B.csv.