Hi, I'm Arwa Elsawy
Junior Data Scientist · Software Engineer
I specialize in turning complex data into actionable insights and production-ready solutions. My experience spans predictive modeling, recommendation systems, web scraping, and deploying AI applications with Python, Odoo, Flask, and Hugging Face. I am passionate about solving real-world problems by bridging the gap between data science and practical business impact.
Skills: Python · SQL · Flask · TensorFlow · scikit-learn · Odoo · AWS · Docker
- Programming: Python, JavaScript, C++, C#, SQL.
- Frameworks & Libraries: Flask, React.js, Node.js, TensorFlow, Keras, scikit-learn, Pandas, OpenCV, BeautifulSoup.
- Databases: PostgreSQL, MySQL, MongoDB, SQLite.
- Data & Analytics: ETL, data wrangling, feature engineering, predictive modeling, time-series forecasting, data visualization.
- Cloud & DevOps: AWS (EC2, S3), Docker.
- Tools & Platforms: Odoo ERP, Power BI, Selenium, Git, VS Code, PyCharm, Hugging Face.
Problem
- Traditional inventory management in Odoo lacked accurate demand forecasting and efficient slotting.- This led to frequent stockouts and increased picking time.Approach
- Implemented AI-powered demand forecasting using Linear Regression with engineered time-series features (lag values, rolling averages, sales differences).- Designed a Genetic Algorithm-based slotting optimizer that allocates products to warehouse slots based on demand frequency, item compatibility, product dimensions, and proximity to dispatch zones.- Seamlessly integrated forecasting and slotting models into the Odoo ERP backend as custom modules with automated workflows.- Built and tested module functionalities using Katalon Studio to ensure reliability and accuracy.Results
- Achieved improved forecast accuracy compared to baseline methods, with Linear Regression outperforming alternative ML algorithms.- Optimized slotting significantly reduced average picker travel distance compared to other tested metaheuristic approaches across 3 different warehouse sizes.Tools & Technologies
- Python, Odoo, PostgreSQL, scikit-learn, pandas, FPGrowth, Genetic Algorithm, Linear Regression, PyCharm, Katalon Studio.

Churn Prediction for Telecom Customers
Problem
- Telecom companies face significant revenue losses when customers discontinue their services (customer churn).- Accurately predicting churn can help businesses identify at-risk customers and take preventive actions.Approach
- Data Preprocessing: Cleaned and transformed the dataset (encoding categorical variables, scaling, handling missing values).- Feature Engineering: Created meaningful features from customer demographics and service usage.- Modeling: Trained and compared multiple machine learning algorithms including Logistic Regression, Random Forest, and XGBoost.- Evaluation: Used accuracy, precision, recall, and F1-score to evaluate models, with a focus on recall.- Deployment: Built an interactive Gradio interface for real-time predictions and deployed it on Hugging Face Spaces.Results
- Logistic Regression achieved the best balance across metrics:- Accuracy: 80.6%
- Precision: 65.7%
- Recall: 55.9%
- F1-Score: 60.4%
- Random Forest and XGBoost performed slightly lower.Tools and Technologies- Python, Pandas, NumPy, Scikit-learn, XGBoost, Logistic Regression, Random Forest, Gradio, Hugging Face Spaces, Jupyter Notebook, VS Code.

Amazon Scraping Project
Problem
- E-commerce platforms like Amazon contain a vast amount of valuable product information, but accessing it manually is time-consuming and inefficient.- Businesses, researchers, and consumers often need structured data (e.g., prices, ratings, and product details) for market analysis.Approach
- Developed a Flask-based web application that enables users to search for products on Amazon and automatically scrape relevant details.- Leveraged Selenium and BeautifulSoup to handle dynamic page content, extract structured information, and provide real-time progress updates on the web interface.- Processed scraped data using Pandas and enabled CSV export for further analysis.Results
- Successfully scrapes product details such as title, price, rating, brand, category, sales rank, image, and product URL.- Provides a modern, responsive interface.- Enables users to download structured datasets (CSV) instantly.Tools and Technologies- Flask, Selenium, BeautifulSoup, Pandas, Webdriver Manager, PowerBI

Movie Recommendation System Using MovieLens Dataset
Problem
- Users needed better personalized movie suggestions.Approach
- Built a hybrid recommender combining collaborative filtering with content-based embeddings; deployed as an interactive Hugging Face app.Results
- Improved recommendation relevance for test users; quick, interactive demo showcases predictions.Tools
- Python, Pandas, scikit-learn, Hugging Face Spaces, Streamlit.

Brain Tumor Detection using Deep Learning Models
Problem
- Detect and segment tumors in MRI scans for research purposes.Approach
- Implemented and compared five deep learning models (AlexNet, MobileNet, VGG16, LSTM, Inception) to classify brain tumors from MRI scans.
- Preprocessed images, trained models, and visualized performance metrics to identify the most accurate classifier.Results
- Experimental results demonstrated that VGG16 model achieved the highest accuracy of 0.97, followed by AlexNet with an accuracy of 0.96.Tools
Python, TensorFlow, OpenCV, pandas, matplotlib, Google Colab.
2048 Game
Problem
- Implement classic 2048 game logic with a functional UI and efficient merging mechanics.Approach
- Programmed game logic, handled tile merging and score calculation, created user interface.Results
- Fully playable game with stable functionality.Tools
- C#, .NET (WinForms), Visual Studio.
Email:
[email protected]
Phone:
+971 50 871 7605