Skip to content

5th edition of 'Combination_of_Concern' - Dataset Source – ‘OpenDataSUS – Registros de Vacinacao Covid 19’ – status on 31 Aug 2021. Personal work done beyond the scope of Alura Bootcamp and courses.

Notifications You must be signed in to change notification settings

amf60/Vac_Covid19_BR_SC_Combinations_of_Concern_5

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 

Repository files navigation

Vac_Covid19_BR_SC_Combinations_of_Concern_5

  • Clinical Data Scientist: Aldir MEDEIROS FILHO
  • FLORIANOPOLIS, Santa Catarina(SC), Brasil
  • 02 Sep 2021

Introduction:

This is the 5th monthly edition of my personal project named 'Combinations_of_Concern - CoC'. The first 4 were initiated and developped during my attendance to Alura Bootcamp Applied Data Science II (Brazilian Winter 2021) that was done in parallel to a sequence of 32 Alura courses focused exclusively on the Python language between 01 May and 30 Aug 2021.

This 5th edition is a continuation and extension of the exploratory work I already done during the first four editions, when I reviewed the OpenDataSus registers of covid19 vaccination in the Brazilian State of Santa Catarina(SC).

For the first two editions, due to my limited knowledge of Python at that time I used a mix of terminal command line, Python&Colab and Excel, one for each of the 3 initial phases of the project. From the 3rd edition onward the learnings from the Alura courses helped to deliver all the 3 phases in one Colab notebook.

The main findings until here were:

From February to end June 2021 a constant trend of ~ 12% of registers for which we can not confirm which vaccine an individual received when I cross-checked 3 variables from the master dataset. I named this troublesome combinations as 'Combinations of Concern' - CoC (step 5 here after for definition).

At the end of July we noticed a reduction to ~ 10% of total of registrations responding to my definition of 'Combination of Concern'.

This 5th edition shows that at end of August 2021 (checked on 02 Sep 21) the monthly trend on the proportion of 'Combinations of Concern' continues in the early teens (11% end August 2021).

##AMF: Technical Warning: For this 5th edition at early September 2021, due to the significant increase on the rows in the raw dataset(sc0209_07am), I was obliged to subscribe to Google Colab Pro in order to have a connection to GPU and high-RAM runtime available. This information is important to know about the limitations of the free version of Google Colab.

I welcome any comments about errors (I am sure there are), suggestions for improvement and collaboration to perform the same investigation for other Brazilian states.

Best Regards

Aldir Medeiros Filho

Dataset Source – OpenDataSus - Registros de Vacinação Covid19 - Dados SC – Santa Catarina – Brasil

For this edition 5 the dataset was downloaded from the central governement database on 02 Sep 2021 at 07:00 am

Editions
'Combinations_of_Concern - CoC'
github links
Published

early
raw file Raw size columns rows
(registrations)
new
registrations
Total nr
CoC's registrations
CoC
% per total
1

2
Jun 21 sc3005_05am.csv 1.11 GB 34 2.059.426 259.507 12 %
3 Jul 21 sc0207_07am.csv 1.71 GB 34 3.133.344 1.116.119 368.654 12 %
4 Aug 21 sc0408_10am.csv 2.6 GB 34 4.854.329 1.720.985 576.103 12 %
5 Sep 21 sc0209_07am.csv 3.66 GB 34 6.822.321 1.967.992 767.214 11%

Covid-19 vaccination records in the Brazilian state of Santa Catarina

For information only, here after all the central government official information for the Brazilian State of Santa Catarina on 02 Sep 2021.

Screen Shot 2021-09-02 at 16 46 43

About

5th edition of 'Combination_of_Concern' - Dataset Source – ‘OpenDataSUS – Registros de Vacinacao Covid 19’ – status on 31 Aug 2021. Personal work done beyond the scope of Alura Bootcamp and courses.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published