Skip to content

dsfsi/project-state-capture

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 

Repository files navigation

South African State Capture Commision Transcripts - Zondo Commission

Give Feedback 📑: DSFSI Resource Feedback Form

About State Capture Comission

The Judicial Commission of Inquiry into Allegations of State Capture, Corruption and Fraud in the Public Sector including Organs of State, better known as the Zondo Commission or State Capture Commission, is a public inquiry established in January 2018 by former President Jacob Zuma to investigate allegations of state capture, corruption, and fraud in the public sector in South Africa.[2][3]

Source: https://en.wikipedia.org/wiki/Zondo_Commission

About Dataset

We extracted plaintext versions of thhe published transcripts (from https://www.statecapture.org.za/site/transcripts. There is minimal clearning but we believe these can be sued for textual analysis.

file/folder description url
data/interim Folder with individuaual .txt files of extracted transcripts by day. /data/interim/
state-capture-transcripts-day-1-399.txt.zip zip file wiht all transcripts. state-capture-transcripts-day-1-399.txt.zip

TODOs

  • Clean up the data
  • Extract sentences
  • Tag conversations by who is talking (speaker)

Authors

  • Tsholofelo Gomba
  • Vukosi Marivate - @vukosi

See also the list of contributors who participated in this project.

Citation

TBA

License

Data is Licensed under CC 4.0 BY SA

Code is Licences under MIT License.