This repository is an initial pipeline for reading, processing, labelling and classifying unstructured annual reports of South African (SA) banks with the aim of identifying financial risk. It leveraged work by the Corporate Financial Information Environment-Final Report Structure Extractor (CFIE–FRSE) of El-Haj et al. which created a corpus of …
nlp
finance
machine-learning
natural-language-processing
dataset
south-africa
african
nlproc
bank-risk
dsfsi-datasets
-
Updated
Oct 26, 2023 - Jupyter Notebook