Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

csv file structure corrupted. #222

Open
liudonghua123 opened this issue Nov 21, 2024 · 1 comment
Open

csv file structure corrupted. #222

liudonghua123 opened this issue Nov 21, 2024 · 1 comment

Comments

@liudonghua123
Copy link

Hi, I run bash fetchData.sh start_date=0d0 end_date=1d1 to download the files. And I extract CallGraph/CallGraph_0.tar.gz to get CallGraph_0.csv.

I use csvq to parse the csv, then I got parse error around line 58755, the rpc_id of this line is 0.1.1.1,0.1.1.1 without quotes.

[root@ha-master-1 CallGraph]# csvq 'select `rpctype`,count(*) from `./CallGraph_0.csv` group by rpctype'
[L:1 C:32] data parse error in /root/code/clusterdata/cluster-trace-microservices-v2022/data/CallGraph/CallGraph_0.csv: line 58755, column 116: wrong number of fields in line
[root@ha-master-1 CallGraph]#

image

And I also find some other strange errors about the data.

image

@liudonghua123
Copy link
Author

I currently use sed -Ei 's/([0-9]+\.[0-9]+(\.[0-9]+)*,\s?)+[0-9]+(\.[0-9]+)*/"\0"/g' CallGraph_0.csv to add quotes around this column of data.

I am not sure if this is correct.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant