Machine Learning Projects
Football - Web Scraper
(Github)
Development
Start: 2021.
End: In Progress
Project description
Python files to extract online player data from the database of “SoFifa” and “SoccerWiki” sites into .csv files, to be used in my Football Manager game.
The file takes data from the sites “SoFifa” and “SoccerWiki” and using Python’s BeautifulSoup looks for Html tags with the necessary information of each player such as name, age, position and potential to save in a “.csv” file. Finished, your process generated files can be imported and used in the football manager application that I am developing with more than 400 teams always updated.
Football Champions - Web Scraper (Wikipedia)
Development
Start: 06/2022.
End: In Progress
Project description
In this project I take data from all the football leagues in the world using Wikipedia and classify the teams in their respective positions. With that I can make predictions of the future positions of the clubs among other analyses.
Of course, the project is missing data as there are many leagues with confusing structures like Apertura and Clausura and knockout championships that have no defined rankings.
All Football Players Stats in History - Web Scraper (Besoccer)
(Github)
Development
Start: 12/2022.
End: 12/2022.
Project description
Project where I get data from the squads of hundreds of football teams from the besoccer website, since 1900. Then I group all this data into a .csv. There is even a jupyter notebook with a quick analysis of existing data.
Streamlit Python - ML Website
(Website)
Development
Start: 12/2022.
End: 12/2022.
Project description
Project for viewing player data with python, through an interactive and online way through the Streamlit platform. It contains data on the greatest players of all time, such as goals scored, assists, games, among other parameters, making it possible to compare the best players in history.
Football Match prediction
(Website)
Development
Start: 10/2022.
End: 10/2022.
Project description
Using Python, I use a dataset with information about the teams, home club and retrospective of the last 5 matches. From there, I filter the dataset in train_set and test_set and apply feature scaling and the random forest method in sklearn’s train_set to obtain a prediction of the results. By modifying the hyperparameters and observing the Confusion Matrix, Recall, Accuracy and F1-Score metrics, the algorithm was improved for better results.
At the end, a 3D graph is generated comparing the real result with the expected, and with another test dataset from a new round of the Brasileirão, predictions are generated.
The algorithm is still imprecise, both due to the unpredictability of the results and the small dataset with little information.
My Music Preferences Analysis - Dashboard
(Github) (Tableau Link) (Gephi PDF)
Development
Start: 06/2023.
End: 06/2023.
Project description
Analysis of my songs filtering the data using Python. Obtaining genres from the Last.fm API and visualization of the results using Gephi, Tableau.
Reinforcement Learning with Q-Learning
(Github)
Development
Start: 02/2023.
End: 02/2023.
Project Description
Deep Q-learning experiment to simulate a player hitting penalties. At first a circle (player) would have to hit a ball in the middle of the screen and make the ball cross a line (goal). However, I realized that the algorithm was not appropriate because it deals better with finite spaces, such as boards. For continuous spaces other more recent algorithms must be used.
Uncertainty in Neural Networks
(Github)
Development
Start: 05/2023.
End: 05/2023.
Project Description
Experimental application of Bayesian Neural Networks and other methods to get the uncertainty and the std. dev. in point predictions.