Skip to content
View vitor-faria's full-sized avatar
Block or Report

Block or report vitor-faria

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
vitor-faria/README.md

Hi, I'm Vitor 👋

Github Badge Linkedin Badge

Brazilian-Italian ISFJ of generalist profile who has shifted from Chemical Engineering to the Data career in tech companies. Over 3 years of work experience in Data Science, Business Intelligence and Analytics Engineering, with the purpose of making data useful for better decision making.

Currently living in Germany, and engaged in the Master Program of Data Science at the University of Mannheim.

👨‍🔬 Projects, Notebooks & Articles

Wikinator: an Akinator-like game based on DBPedia's Knowledge Graph (EN/🇺🇸)

Team project for the module of Semantic Web Technologies / Knowledge Graphs, as part of the program Mannheim Master in Data Science of the University of Mannheim, during the winter semester of 2022/23. It is a game application inspired in Akinator, that tries to guess which real world person or fictional character the players are thinking of, but relying solely on data available in DBPedia's Knowledge Graph and using only SPARQL queries for data extraction.

Web Structure Mining: Paper importance prediction using Graph Neural Networks (EN/🇺🇸)

Team project for the module of Web Mining, as part of the program Mannheim Master in Data Science of the University of Mannheim, during the summer semester of 2022. In this work, Graph Neural Networks and traditional Machine Learning approaches were exploited to predict the importance a paper will have once it is published, given the citation network.

Information Retrieval: Learning-To-Rank with embeddings for document retrieval (EN/🇺🇸)

Team project for the module of Information Retrieval and Web Search, as part of the program Mannheim Master in Data Science of the University of Mannheim, during the summer semester of 2022. In this work, traditional IR approaches such as TF-IDF and BM-25, as well as recent embedding techniques, such as GloVe, BERT and BART, were exploited in a Learning to Rank algorithm for document retrieval.

Sentiment Analysis of headlines about US presidents in their first year of mandate (EN/🇺🇸)

Final project for the module of Automated Media Content Analysis, as part of the program Mannheim Master in Data Science of the University of Mannheim, during the winter semester of 2021/22. The sentiment of headlines and snippets from The New York Times articles concerning two United States presidents (Donald Trump and Joe Biden) in their first year of government were analyzed with NLP techniques.

TeraBeer: a recommender system of brazilian craft beers (PT/🇧🇷)

Recommending system of brazilian craft beers based on the user's taste for food and beverages. This was the final project of Tera's bootcamp of Data Science & Machine Learning, developed by me and a group of 4 other students and presented in a Demoday's panel of experts.

Default Prediction App (EN/🇺🇸)

Simple streamlit application to interact with a ML classification model I created based on PKDD'99 default financial data of a czech bank. This was an exercise proposed by Tera's Bootcamp during the Deploy module.

👨‍💻 Work Experience

Data Scientist @ Jusbrasil (2021-current)

Jusbrasil is the leading legal website in Brazil, where people can find laws, lawsuits, court decisions and the legal reputation of organizations and persons, and aims to be widely recognized as the Single Source of Truth for legal information. The company uses web-crawling over hundreds of sources, machine learning algorithms and NLP to build the richest legal document collection, and distributes this information with unprecedented precision and recall in its search engine and other products.

Currently working as a Data Scientist in our Knowledge Base team, my aim is to add intelligence to Jusbrasil's large digital asset/Knowledge Graph with state-of-the-art text processing techniques.

Prior: Data Analyst
As Full-Stack Data Analyst, my aim was to optimize our analytical environment for generation of insights, data-driven decision-making and predictive analytics. Main activities were:
  • building core datasets in BigQuery, to be used by Business teams, Product squads and other Data Analysts;
  • creating interactive dashboards and advanced SQL questions in Metabase to scale behavioral analytics;
  • orchestrating data workflows in Airflow, such as ETL pipelines, batch predictions of Machine Learning models and table snapshots.

Data Analyst @ Platos Educação (2019-2021)

About this experience
Platos was the part of the holding Cogna Educação, one of the world's largest educational organizations, that served the B2B market of Higher Education. The company offered, under the brand Saraiva Educação, a range of educational services and solutions for universities across the country, such as digital libraries, digital learning environments and online preparation for nation-wide exams. In our Data Science, Engineering and Analytics team, I:
  • built Machine Learning models, including an end-to-end book recommender system;
  • created interactive dashboards for Product, Marketing, Customer Success and Sales teams using Metabase as BI & ad-hoc platform;
  • automated reports that were sent to client universities using Python (Django, Pandas and Matplotlib);
  • created, maintained and optimized ETL pipelines to ingest data in our Google BigQuery Data Warehouse;
  • provided on-demand business and product insights based on Exploratory Data Analysis;
  • played a protagonist role in spreading the data-driven culture along the company.

BI Intern @ Somos Educação (2018-2019)

About this experience
SOMOS was (when aquired by Kroton to further become Cogna Educação) the largest group of basic education in Brazil and impacted more than 27 million students across Brazil through various brands. My role as BI Intern in the Business Unit of Solutions for Higher and Technical Education was to:
  • create BI dashboards in order to keep track of the top OKR's;
  • provide business insights to the leaders;
  • use Data Storytelling to build visuals and slides for Radar meetings;
  • develop processes to improve Knowledge Management.

Junior Entrepreneur @ Mult Jr (2015-2017)

About this experience

Mult Jr is a Junior Enterprise voluntarily managed by Chemical Engineering students that provides solutions under the technical guidance of Professors from the University. And it is where I fell in love with Excel spreadsheets and Data Analysis, while working in the Financial, HR and IT departments.

CFO

  • Legal representation of the JE.
  • Manage an annual budget of ~R$ 50k.
  • Lead a 5 member team.
  • Ensure the execution of financial, accounting and legal processes, such as cash flow and drafting of contracts.
  • Define Pricing strategies.

IT Coordinator

  • Lead a 7 member team.
  • Maintain the functioning of the site and other virtual tools.
  • Develop spreadsheets and applications for other teams.
  • Provide adequate training in virtual tools such as Excel, VBA and PowerPoint.

HR analyst

  • Recruitment and selection.
  • Coach other members.
  • Analyze organizational climate.
  • Evaluate member performance.

👨‍🎓 Education

Master in Data Science @ Universität Mannheim 🇩🇪

The Mannheim Master in Data Science is an interdisciplinary program of study that is unique in Germany. It merges the fields of Business Informatics, Sociology, Political Science, and Mathematics and teaches students how to collect, organize, analyze, and visualize large amounts of data using the appropriate tools and methods with a practice-oriented curriculum.

Completed modules
Team Projects

Data Science and Machine Learning Bootcamp @ Tera 🇧🇷

Tera is a modern school focused on Project Based Learning with Market experts of the Digital Economy. In this challenging 6-month bootcamp, me and my group developed and launched the TeraBeer project - a recommending system of brazilian craft beers based on the user's taste for food and beverages.

B.Sc. in Chemical Engineering @ UFMG 🇧🇷

The Bachelor of Science in Chemical Engineering at Universidade Federal de Minas Gerais graduates engineers capable of acting in the most diverse areas of the chemical industry, such as food & beverages, paper & pulp, petrochemical, among others. All the professors are PhDs in the university and the main focuses are in Transport Phenomena, Thermodynamics, Kinetics & Reactor Design and Unit Operations. During university years I was engaged in different extracurricular activities:

Automation & Simulation Summer School @ RTWH Aachen 🇩🇪

About this activity

The Summer Schools are courses provided by the International Academy of the RWTH Aachen University targeting Engineering students of outstanding academic performance from all over the world. The program of the 4-week Automation and Simulation course gather many activities, such as lectures and exercises about Nummerical Methods in Matlab and Robot Automation, classes about german language and culture, visits to state of the art german companies, excursions and intercultural training. The course took place during the month of July, 2019.

Academic Exchange @ FAU Erlangen 🇩🇪

About this activity

1 semester academic exchange at Friedrich-Alexander Universität through the program Minas Mundi (UFMG), from April 2018 to August 2018. All lessons were taught in German. Language courses: Deutsch Intensivkurs C1.1 (March 2018, 5 ECTS), Deutsch Allgemeinkurs C1 (April to July 2018, 5 ECTS) - Sprachzentrum.

Volunteer @ Equalizar

About this activity

Founded in 2012, Equalizar is a social project based at the Engineering School of UFMG that provides low-cost preparation for ENEM, the exam used to enter most public and private universities, helping vulnerable students from the public system to change their lives. Equalizar is totally managed by volunteers and helps +100 students every year. I worked voluntarily at Equalizar between 2014 and 2016 in different positions such as Math monitor, HR assistant and Communication director.

Scientific Initiation @ CDTN

About this activity

1-year Scientific Initiation at Centro de Desenvolvimento de Tecnologia Nuclear, working on the project "Obtaining Graphene and Graphene Oxide in Aqueous Environment for Contaminant Adsorption" together with doctoral students. The aim of the project is to optimize graphene extraction by the exfoliation method in the liquid phase using water as solvent and to study the use of graphene oxides to clean water contaminated with radioactive substances by the adsorption method.

🤾‍♂️ Hobbies

  • Brewing different styles of craft beer 🍺
  • Reading and discussing Fiction in Book Clubs 📚
  • Spending more time deciding what to watch next in Netflix than actually watching 🎥
  • Running, cycling and skipping 🏃

Pinned

  1. tera-beer-recommendations tera-beer-recommendations Public

    Forked from lnpsiqueira/tera_beeer_v2

    Python 2 1

  2. attrition-prevention attrition-prevention Public

    Exploratory Data Analysis and Predictive Modeling using Machine Learning algorithms to predict employee attrition up on IBM's Attrition Dataset. Notebook in brazilian portuguese.

    Jupyter Notebook

  3. default-prediction-app default-prediction-app Public

    Simple streamlit application to interact with a ML classification model based on PKDD'99 default financial data.

    Jupyter Notebook 1