Skip to content
View Stefano-Trovato-89's full-sized avatar

Block or report Stefano-Trovato-89

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stefano-Trovato-89/README.md

👋 Hi, I'm Stefano Trovato 🧑‍🔬
A biologist turned Data Scientist, passionate about applying AI to real-world challenges in science, health, and the environment.

🎓 I’m enrolled in multiple advanced programs:

  • Master in Data Science (completed, with 11+ end-to-end projects in ML, NLP, Big Data and explainability)
  • Master in Data Engineering (Spark, Databricks, SQL, NoSQL, Cloud)
  • AI Engineering & AI Development (Deep Learning, NLP, pipelines)

⚙️ My stack includes Python (Pandas, Scikit-learn, TensorFlow, PyTorch), SQL, Spark, Tableau – with growing experience in Docker, Airflow, and cloud tools (AWS, Azure, Snowflake).

💡 I value practicality and impact, turning data into useful and innovative solutions.

🎨 Outside of work, I enjoy drawing and swimming, to stay creative and focused.

📫 Connect with me on LinkedIn

Pinned Loading

  1. spam-detection-nlp spam-detection-nlp Public

    NLP project for spam detection using Naive Bayes, topic modeling, semantic distance analysis, and organization extraction to improve email classification and business intelligence.

    Jupyter Notebook

  2. toxic-comments-filter toxic-comments-filter Public

    Deep Learning project for social media moderation using LSTM/GRU models to classify comments into six toxicity categories (toxic, obscene, threat, insult, identity hate), enabling real-time detecti…

    Jupyter Notebook

  3. creditworthiness-prediction creditworthiness-prediction Public

    Supervised ML pipeline for predicting customer creditworthiness in credit-card approval. Covers EDA, feature engineering, imbalance handling, and multiple models (LogReg, DT, RF, GBT, Balanced RF, …

    Jupyter Notebook

  4. wikipedia-bigdata-analysis wikipedia-bigdata-analysis Public

    Big Data analysis and machine learning classification of Wikipedia articles in Databricks, including EDA with descriptive statistics and word clouds, plus supervised models for automatic article ca…

    Jupyter Notebook

  5. banking-customer-features-sql banking-customer-features-sql Public

    SQL project to build a denormalized feature table of banking customers, including behavioral indicators from multiple tables, for use in supervised machine learning models.

  6. inferential-statistics-project inferential-statistics-project Public

    Statistical modeling project using R to predict newborn birth weight based on clinical variables, improving high-risk pregnancy management and hospital resource optimization.

    HTML