Available for Opportunities

Mustafa GUL

Data Professional

Transforming raw data into actionable insights through pipeline development, analytics, and machine learning solutions.

Data Engineering

ETL Pipelines, Airflow, PySpark

Machine Learning

Predictive Models, LLM Integration

Location

Belgium • EU Work Authorization

8+
Projects
50K+
Records
85%
Efficiency

About Me

Recent Junior Data Engineer Intern at MinersAI with hands-on experience across the full data lifecycle: ETL pipelines, exploratory analysis, ML model development, and workflow automation. My philosophy background brings exceptional problem-solving skills and critical thinking to data challenges. Currently expanding expertise through continuous learning while building production-ready solutions.

Education

Master's in Philosophy

Istanbul MSGSU, Turkey (2020-2022)

Bachelor of Philosophy

Ankara YBU, Turkey (2013-2018)

Minors: Psychology, Sociology

Languages

EnglishB2
FrenchB1

Interests

Reading
Chess
Cinema
Cycling

Work Authorization

Fully eligible to work in Belgium and EU

Technical Skills

Comprehensive technical expertise spanning the entire data lifecycle, from pipeline development to ML deployment, with active learning through Coursera and DataCamp.

Programming & Analysis

Python (Pandas, NumPy)90%
SQL (PostgreSQL, MySQL)90%
R (Statistical Analysis)75%
JavaScript70%

Data Engineering

Apache Spark & PySpark80%
Airflow75%
n8n Automation80%
Docker75%
ETL Pipeline Design85%

Cloud & Databases

AWS (S3, RDS, Lambda)70%
Google Cloud (BigQuery)65%
PostgreSQL & MySQL85%
MongoDB & Redis70%

Machine Learning & AI

Scikit-Learn & TensorFlow80%
LangChain & OpenAI API75%
Feature Engineering85%
Model Deployment75%

Data Analysis & Viz

Exploratory Data Analysis90%
Statistical Testing80%
Matplotlib & Seaborn85%
Streamlit & Plotly80%
Power BI65%

Tools & Technologies

Git & CI/CD80%
GeoPandas75%
Web Scraping85%
OCR & LLM Integration75%
FastAPI & Flask80%

Professional Experience

Hands-on experience across the full data lifecycle, from ETL pipeline development to ML model deployment, with proven impact in production environments and real-world data challenges.

Internship
Jan 2025 - Mar 2025
Belgium

Junior Data Engineer Intern

MinersAI

Working on advanced data engineering solutions for geological data processing and AI integration.

Designed and optimized ETL pipelines for geological data processing with Python and PySpark

Performed exploratory data analysis on 50K+ geological records, identifying key patterns and anomalies

Integrated LLMs to convert unstructured geological text into structured formats, improving data accessibility by 85%

Implemented OCR + LLM solutions for extracting data from geological PDFs and maps

Structured geospatial and vector data for AI/ML analysis using Geopandas

Created dashboards and visualizations for data quality monitoring

Training
May 2024 - Dec 2024
Belgium

Data Engineer Trainee

BeCode

Completed intensive 7-month bootcamp covering data engineering, analytics, and machine learning.

Built 8+ end-to-end projects: data pipelines, ML models, dashboards, and automated workflows

Hands-on experience with large-scale datasets (1M+ records) and modern data stack technologies

Conducted exploratory data analysis, feature engineering, and model evaluation

Developed expertise in Python, SQL, PySpark, Airflow, and cloud platforms

Mastered data pipeline development, ETL processes, and workflow automation

Education
2021 - 2022
Turkey

P4C Instructor/Mentor

Private Educational Institution

Teaching critical thinking through Philosophy for Children (P4C) methodology.

Designed and delivered philosophy curricula for students of various age groups

Developed critical thinking and analytical skills in students through Socratic dialogue

Created engaging educational materials and interactive learning experiences

Mentored students in developing logical reasoning and ethical thinking

Fostered inclusive classroom environments promoting intellectual curiosity

Volunteering
2022 - 2023
Belgium

International Volunteer

European Solidarity Corps (ESC)

Animator organizing projects based on social inclusion and integration for diverse communities.

Organized and facilitated community integration events for international participants

Developed intercultural communication skills while working with diverse groups

Created educational workshops focused on social inclusion and cultural exchange

Coordinated logistics for large-scale community projects and events

Built strong networks within international volunteer communities

Featured Projects

Production-ready solutions demonstrating expertise in data engineering, machine learning, and AI integration with measurable impact and real-world applications.

AI/ML
Completed

AI-Powered SQL Query Generator

Natural language to SQL converter enabling non-technical users to query databases using plain English. Features 95% query accuracy with <2s response time.

Key Features:

  • Natural language processing for SQL generation
  • 95% query accuracy with sub-2-second response time
  • User-friendly Streamlit interface
  • Support for complex JOIN operations and aggregations
PythonOpenAI APIStreamlitDatabase Design
Machine Learning
Completed

Wine Recommendation System

ML-based recommendation engine using collaborative filtering to suggest wines based on user preferences. Trained on 10,000+ wine records with 87% prediction accuracy.

Key Features:

  • Collaborative filtering algorithm implementation
  • 87% prediction accuracy on test dataset
  • RESTful API with Flask for integration
  • Trained on dataset of 10,000+ wine records
PythonScikit-LearnFlaskCollaborative Filtering
Data Science
Completed

Real Estate Price Prediction

End-to-end ML pipeline for predicting real estate prices with advanced feature engineering. Achieved 92% R² score with production-ready API deployment on AWS.

Key Features:

  • Advanced feature engineering and selection
  • 92% R² score on validation dataset
  • Production-ready API with <200ms response time
  • Containerized deployment with Docker on AWS
Machine LearningAWSDockerFeature Engineering
AI/NLP
Completed

Document Q&A System

AI-powered document analysis system using semantic search and LLMs. Capable of processing 100+ pages in 30 seconds with 92% accuracy.

Key Features:

  • Semantic search with vector embeddings
  • Processes 100+ pages in under 30 seconds
  • 92% accuracy on question-answering tasks
  • Integration with multiple LLM providers
NLPLangChainVector EmbeddingsPython
Data Engineering
Completed

Data Automation Workflows

Intelligent automation workflows using n8n for data processing tasks. Reduced manual data processing time by 70% through automated pipelines.

Key Features:

  • Automated data collection from multiple sources
  • 70% reduction in manual processing time
  • Error handling and retry mechanisms
  • Real-time monitoring and alerting
n8nWorkflow AutomationAPI IntegrationPython
Web Development
Completed

Temporary Email Service

API-based disposable email generation system with 99.9% uptime. Handles 1000+ daily requests with robust error handling.

Key Features:

  • RESTful API with FastAPI framework
  • 99.9% uptime with error handling
  • Handles 1000+ requests per day
  • Comprehensive API documentation
FastAPIWeb ScrapingPythonAPI Design

Open to New Opportunities

Currently seeking Data Analyst, Data Engineer, Junior Data Scientist, or ML Engineer roles in Belgium. Available for full-time positions, freelance projects, and technical collaborations.

Let's Connect

Get In Touch

Currently seeking Data Analyst, Data Engineer, Junior Data Scientist, or ML Engineer roles in Belgium. Available for full-time positions, freelance projects, and technical collaborations.

Contact Information

Email

mstfgul00@gmail.com

Phone

+32-467-83-9465

Location

Marche-en-Famenne, BE

Follow Me

Send a Message

Ready to Start a Conversation?

Open to discussing data engineering projects, analytics opportunities, ML collaborations, or any technical challenges involving data. Location: Marche-en-Famenne, Belgium (EU work authorization).