Marlon Rodríguez Flor

Hi, my name is

Marlon.

Data Scientist

Computer Science Engineer

Passionate about developing artificial intelligence models that uncover hidden patterns and insights within data.

Resume

About Me

I am a Data Scientist with a Master’s degree in Data Science from the Autonomous University of Madrid, where my research focused on applying artificial intelligence to recommendation systems, specifically tackling challenges in Multimodal Extreme Multi-Label Classification (XMC). I also hold an Engineering degree in Computer Science from San Francisco de Quito University.

Throughout my career, I have developed and deployed machine learning models to optimize decision-making, enhance predictive capabilities, and drive business value. At Banco Solidario S.A., I led data science initiatives, applying machine learning and statistical models to improve customer segmentation, risk assessment, and sales efficiency. My work contributed to streamlining business processes and increasing operational effectiveness through AI-driven insights.

I have extensive experience with Transformer architectures, having applied them in-depth during my Master’s thesis. Additionally, I have worked with a wide range of libraries, including PyTorch, Hugging Face, Scikit-Learn, NumPy, and Pandas/Polars among others, to develop scalable and efficient machine learning models. My expertise covers Natural Language Processing (NLP), Computer Vision, Biometrics, Recommendation Systems, and Reinforcement Learning.

Here are a few technologies I've been working with recently:

Python
R
Java
SQL
Spark
C++
Dart
C#
CSS
HTML
LaTeX

Education

Master's Degree in Data Science

Universidad Autónoma de Madrid (UAM)

Sep 2023 - Feb 2025

GPA: 8.03 out of 10

Master’s Thesis: Multimodal Extreme Multi-Label Classification Under Resource Limitations (Grade: 9.5/10).

Developed a novel, resource-efficient multimodal architecture for Extreme Multi-Label Classification (XMC), integrating text and image modalities using an early fusion model based on transformers. The proposed architecture remains compatible with state-of-the-art XMC methods such as DEXA and NGAME, enhancing classification performance while ensuring computational efficiency and scalability. Extensive experiments on the MM-AmazonTitles-300K benchmark demonstrated that our approach outperforms existing methods, setting a new state-of-the-art in multimodal XMC.

Extracurricular Activities:

Delegate of the Master’s Degree in Data Science.

Relevant coursework:

Advanced Methods in Machine Learning.
Advanced Methods in Statistics.
Deep Learning for Biometric Information Processing.
Deep Learning for Signal Processing Image and Video.
Large-Scale Data Processing.
Natural Language Processing (NLP).
Reinforcement Learning.
Temporal Information Processing.
Unstructured Information.
Data Management.

Bachelor’s Degree in Computer Science Engineering

Universidad San Francisco de Quito (USFQ)

Aug 2017 - Jun 2021

GPA: 2.94 out of 4

Degree Thesis: Path Planning Optimization in SDN Using Machine Learning Techniques (Grade: A).

Developed a machine learning-based approach to optimize path planning in Software-Defined Networks (SDN), improving network QoS. Formulated path selection as a multi-class classification problem and evaluated multiple classifiers. The best-performing model, a support vector machine, outperformed alternative methods
Published in the 2021 IEEE Fifth Ecuador Technical Chapters Meeting (ETCM) https://ieeexplore.ieee.org/document/9590749.

Relevant coursework:

Artificial Intelligence.
Calculus (I, II and III).
Data Structures and Algorithms.
Data Mining.
Databases.
Linear Algebra.
Programming (I, II and III).
Probability and Statistics.
Systems Design.
Projects: Management and Analysis.

Experience

Data Analytics Officer

Banco Solidario S.A.

Jan 2022 - Ago 2023

Increased the balance in savings accounts by USD 200,000 by identifying over 38,000 potential clients who increased their balance by more than USD 260 through the implementation of an XGBoost model, with 7% of them achieving this increase.
Lead and work closely with product owners to develop successful projects, communicating findings and results clearly and effectively to non-technical audiences.

Data Analytics Technician

Banco Solidario S.A.

Jul 2021 - Dic 2021

Increased the number of downloads of Banco Solidario’s mobile app by 30% and reduce the cost per download by 22% by implementing a Random Forest model to identify potential customers for digitalization, also improving customer segmentation.
Enhanced customer experience and boost sales by developing an interactive dashboard to monitor the efficiency of time in sales and services for commercial advisors at Banco Solidario, enabling targeted actions at each branch.

Publications

Path Planning Optimization in SDN Using Machine Learning Techniques

2021 IEEE Fifth Ecuador Technical Chapters Meeting (ETCM)

Oct 2021

Available on IEEE Xplore

Certificates

IELTS Academic

IELTS

Overall Band Score: 6.5 (Mar 2023)

Check the Certificate

Program in Statistics and Data Science

MITx

6.419x: Data Analysis: Statistical Modeling and Computation in Applications. (Sep 2022)
18.6501x: Fundamentals of Statistics. (May 2022)
6.86x: Machine Learning with Python-From Linear Models to Deep Learning. (Nov 2021)
6.431x: Probability - The Science of Uncertainty and Data. (Sep 2021)

Achievements

Ideatón 2023 of BANCO SOLIDARIO S.A.

We achieved second place for the solution proposed to a strategic challenge of the bank.

Relevant Projects

Python PyTorch Torchvision Semantic Segmentation Albumentations Computer Vision

Image Segmentation with U-Net and DeepLabV3 on CUB-200-2011 and Pascal VOC2012

This project explores semantic segmentation on CUB-200-2011 and Pascal VOC 2012 using U-Net and DeepLabV3 with PyTorch, combining data augmentation, class balancing, and hybrid loss functions.

Available on GitHub

Python Scikit-learn Machine Learning (ML)

Gradient Boosting Implementation

This project features an implementation of the Gradient Boosting algorithm, an ensemble method that combines multiple decision trees (stumps). It utilizes gradient descent optimization to minimize the loss function. The collective contributions of all weak models (stumps) result in a robust predictive model.

Available on GitHub

Python PyTorch Computer Vision

Multiple Object Tracking for Video Sequences

This project addresses the task of multiple object tracking (MOT), specifically focusing on tracking people walking in video sequences. The base model for detection and tracking is enhanced using advanced techniques to improve performance. The dataset used is MOT16, which contains various scenarios for individual detection.

Available on GitHub

Python WordNet Beautiful Soup Natural Language Processing (NLP)

Analysis of Emotions in Classic Novels

This project uses Natural Language Processing (NLP) to analyze emotions in literary texts from Project Gutenberg, aiming to identify and quantify emotions through advanced NLP methods like sentiment analysis and text information extraction.

Available on GitHub

Python Reinforcement Learning (RL)

Q Learning and SARSA Implementation

This project demonstrates the implementation of two reinforcement learning algorithms: Q Learning and SARSA. These algorithms are evaluated across various grid world maps to analyze their performance and behavior.

Available on GitHub

Python TensorFlow Keras Recommendation Systems

Recommendation: Matrix Factorization and Deep-Learning

This project demonstrates collaborative filtering on the movie ratings dataset using matrix factorization combined with neural networks.

Available on GitHub

Get in Touch

My inbox is always open. Whether you have a question or just want to say hi, I’ll try my best to get back to you!

Mail me