Available for Opportunities

Hi, I'm Dinuja Perera

Machine Learning Engineer

NLP & Predictive ModellingData Science

Machine Learning Engineer with experience building data-driven solutions for real-world industrial and business problems. I am particularly looking for opportunities where models are developed, deployed, and maintained in production environments.

Match my experience to your role

Paste your job description and I'll highlight the most relevant skills, projects, and research instantly.

United KingdomTier 1 Global Talent VisaFull UK Work Rights | Not Required Sponsorship
AI and Machine Learning
Experience2+ Years
Core FocusMachine Learning / AI
ResearchNLP, Deep Learning & Time Series Forecasting
About Me

Transforming Data into Reliable Intelligence

Machine Learning Engineer with a passion for building ML systems
that solve real-world problems

Current Work

Currently developing machine learning models to predict gas turbine trips using time-series modelling and survival analysis techniques.

UK Global Talent

Endorsed by UK Research and Innovation (UKRI)

ML Specialist

Generative AI, LSTM, RNN, and NLP expert

Collaborative

Cross-functional team experience

UK Work Rights

Full UK work rights | Sponsorship not required

Impact Driven

Real-world AI solutions for industry

What I Bring

  • Python & ML Development
  • Deep Learning (LSTM, ESN, RNN)
  • Azure Databricks & Scalable Computing
  • NLP & Transformer Models
  • SQL & Database Management
  • Model Evaluation & Explainability (SHAP/LIME)
  • Accelerated Computing (GPU/TPU)
  • Data Engineering & Pipelines
Experience

Professional Journey

Building ML solutions across industries and academia

AI Residency

Apziva

Feb 2026 - PresentUnited Kingdom
  • Developed machine learning solutions across NLP, computer vision, and predictive modelling projects
  • Built a candidate ranking system using NLP and similarity-based learning approaches
  • Designed a term deposit subscription prediction model with customer segmentation using K-Means
  • Applied feature selection and model evaluation techniques to improve model performance

Machine Learning Engineer (Predictive Maintenance)

Uniper UK | Loughborough University

June 2024 - PresentUnited Kingdom
  • Developed LSTM and Echo State Network models on turbine sensor data to predict system behaviour and identify early signs of faults
  • Analysed correlations and autocorrelation patterns to detect faulty instruments and understand sensor behaviour
  • Applied clustering techniques to group sensor patterns and improve feature engineering
  • Processed large-scale time-series data using Azure Databricks with GPU acceleration
  • Built visualisations and dashboards to communicate insights to engineers and managers
  • Worked closely with engineers and stakeholders to translate model outputs into practical operational insights

Machine Learning Engineer

CML Insight Inc., Texas, USA

Nov 2021 – Jan 2023Remote
  • Built and maintained machine learning pipelines for customer behaviour analysis
  • Developed NLP-based features including embeddings and sentiment analysis
  • Improved model reliability through feature importance analysis and data validation
  • Supported team members and interns in developing machine learning workflows

Teaching Assistant

Department of Information Systems Engineering, University of Colombo

Jun 2019 - Nov 2021Sri Lanka
  • Conducted practical sessions and supervised projects for master's and undergraduate courses, including Machine Learning and Neural Computing, Data Analytics, and Embedded Systems
  • Co-ordinated grading assignments, conducting code reviews, and invigilating exams for undergraduate and master's level programmes

Data Analyst

Brandix Apparel Limited

Jun 2019Sri Lanka
  • Analysed spare parts datasets using Microsoft Excel pivot tables to ensure data accuracy
  • Collaborated with cross-functional teams to identify and rectify data quality issues
  • Conducted assessments of discrepancies and inaccuracies to enhance overall data integrity

Engineering Trainee

Airport & Aviation Services Sri Lanka

Aug 2018 - Jan 2019Sri Lanka
  • Familiarised with navigation security surveillance communication systems
  • Participated in preventive maintenance activities to ensure optimal performance
  • Assigned to the Department of Air Navigation and Engineering to enhance operational efficiency
Technical Skills

Tech Stack & Expertise

A comprehensive toolkit for building intelligent systems

Languages & Frameworks

Python logo

Python

TensorFlow logo

TensorFlow

Keras logo

Keras

PyTorch logo

PyTorch

Scikit-learn logo

Scikit-learn

PySpark logo

PySpark

Pandas logo

Pandas

NumPy logo

NumPy

Cloud & Infrastructure

Azure Databricks logo

Azure Databricks

Google Colab logo

Google Colab

AWS S3

AWS Hadoop logo

AWS Hadoop

AWS Spark logo

AWS Spark

Databases

MySQL logo

MySQL

DuckDB logo

DuckDB

MongoDB logo

MongoDB

Cassandra logo

Cassandra

Redis logo

Redis

Neo4j logo

Neo4j

Analytics & Visualisation

Power BI

Tableau

Matplotlib logo

Matplotlib

Seaborn logo

Seaborn

Excel Pivot Tables

R logo

R

PCA

ML Models & Techniques

LSTM

ESN

NLP

RNN

Isolation Forest

Autoencoders

Random Forest

SVM

K-Means

XGBoost logo

XGBoost

Generative AI

LangChain logo

LangChain

LLM

Collaboration

GitHub logo

GitHub

ClickUp logo

ClickUp

Confluence logo

Confluence

Languages & Frameworks

Python logo

Python

TensorFlow logo

TensorFlow

Keras logo

Keras

PyTorch logo

PyTorch

Scikit-learn logo

Scikit-learn

PySpark logo

PySpark

Pandas logo

Pandas

NumPy logo

NumPy

Cloud & Infrastructure

Azure Databricks logo

Azure Databricks

Google Colab logo

Google Colab

AWS S3

AWS Hadoop logo

AWS Hadoop

AWS Spark logo

AWS Spark

Databases

MySQL logo

MySQL

DuckDB logo

DuckDB

MongoDB logo

MongoDB

Cassandra logo

Cassandra

Redis logo

Redis

Neo4j logo

Neo4j

Analytics & Visualisation

Power BI

Tableau

Matplotlib logo

Matplotlib

Seaborn logo

Seaborn

Excel Pivot Tables

R logo

R

PCA

ML Models & Techniques

LSTM

ESN

NLP

RNN

Isolation Forest

Autoencoders

Random Forest

SVM

K-Means

XGBoost logo

XGBoost

Generative AI

LangChain logo

LangChain

LLM

Collaboration

GitHub logo

GitHub

ClickUp logo

ClickUp

Confluence logo

Confluence

Languages & Frameworks

Python logo

Python

TensorFlow logo

TensorFlow

Keras logo

Keras

PyTorch logo

PyTorch

Scikit-learn logo

Scikit-learn

PySpark logo

PySpark

Pandas logo

Pandas

NumPy logo

NumPy

Cloud & Infrastructure

Azure Databricks logo

Azure Databricks

Google Colab logo

Google Colab

AWS S3

AWS Hadoop logo

AWS Hadoop

AWS Spark logo

AWS Spark

Databases

MySQL logo

MySQL

DuckDB logo

DuckDB

MongoDB logo

MongoDB

Cassandra logo

Cassandra

Redis logo

Redis

Neo4j logo

Neo4j

Analytics & Visualisation

Power BI

Tableau

Matplotlib logo

Matplotlib

Seaborn logo

Seaborn

Excel Pivot Tables

R logo

R

PCA

ML Models & Techniques

LSTM

ESN

NLP

RNN

Isolation Forest

Autoencoders

Random Forest

SVM

K-Means

XGBoost logo

XGBoost

Generative AI

LangChain logo

LangChain

LLM

Collaboration

GitHub logo

GitHub

ClickUp logo

ClickUp

Confluence logo

Confluence

Research & Projects

Featured Research & Projects

Showcasing research, production ML systems, and experimentation

Transformer-Based Sentiment Analysis of Company Reviews
MSc Dissertation

Transformer-Based Sentiment Analysis of Company Reviews

MSc Dissertation - NLP & Interactive Web Application

Built a transformer-based NLP system to analyse employee reviews from 500 UK companies. Compared models including BERT, RoBERTa, and XLNet, achieving 76% accuracy. Integrated topic modelling and aspect-based sentiment analysis to extract key insights from text data.

NLPTransformersBERTXLNetPyTorchLDANMFAWSSentiment Analysis
Term Deposit Subscriber Profiling
Research

Term Deposit Subscriber Profiling

Customer Segmentation and Subscription Prediction

Developed a machine learning solution to identify customers likely to subscribe to term deposit products. Performed exploratory data analysis and feature engineering on customer and campaign data. Applied K-Means clustering to segment customers based on behaviour. Used DuckDB and SQL queries within Python to analyse structured datasets efficiently. Helped identify high-value customer groups and reduce unnecessary marketing effort.

PythonDuckDBK-MeansPandasOOPData AnalysisCustomer Segmentation
Unhappy Customers in Logistics Delivery
Research

Unhappy Customers in Logistics Delivery

Customer Dissatisfaction Prediction using Machine Learning

Built a machine learning solution to detect dissatisfied customers in logistics delivery services using operational and feedback data. Identified patterns indicating poor delivery experiences and predicted likely dissatisfaction. Used LazyPredict to benchmark multiple classification algorithms and Hyperopt to optimize hyperparameters for improved performance.

PythonMachine LearningLazyPredictHyperoptScikit-learnOOPPandasData Analysis
Predicting Road Accident Severity in the UK
Research

Predicting Road Accident Severity in the UK

Supervised Machine Learning Classification

Applied supervised machine learning to predict the severity of road accidents across the UK using 2019 public data. Evaluated Random Forest, SVM, Decision Tree, KNN, and Deep Neural Networks. The deep neural network achieved the highest accuracy of 80.65% in classifying accidents as 'Slight,' 'Serious,' or 'Fatal.'

Machine LearningDeep LearningRandom ForestSVMNeural NetworksPythonTensorFlowKerasPCA
Metal Part Lifespan Prediction and Defect Classification
Research

Metal Part Lifespan Prediction and Defect Classification

Regression & Defect Detection with ML

Investigated metal part manufacturing datasets to predict part lifespan and classify defects. Regression models (Linear, Lasso, Ridge, Random Forest) were compared, with Random Forest achieving 97% accuracy. For defect detection, both binary classifiers and CNNs were tested. K-Means clustering revealed distinct process parameter groups influencing part quality.

RegressionClassificationRandom ForestCNNK-MeansGridSearchCVTensorFlowScikit-learn
Employee Churn Risk Prediction and Behavioural Analytics
Research

Employee Churn Risk Prediction and Behavioural Analytics

Causal Machine Learning for Workforce Analytics

Developed causal machine learning models to understand employee turnover behaviours. Used Random Forest and statistical feature importance techniques to identify the most influential drivers of churn. Performed feature leakage detection, refined model input space, and enhanced model generalisation. Integrated email sentiment analysis as an additional behavioural signal. Work carried out in a Linux-based environment using object-oriented Python.

Machine LearningRandom ForestSentiment AnalysisPythonScikit-learnFeature EngineeringLinuxOOP
Comparative Analysis of BART and RoBERTa for Hate Speech Detection
Research

Comparative Analysis of BART and RoBERTa for Hate Speech Detection

Published Research - WiNLP 2022

Explored transformer-based approaches for detecting hate speech on YouTube and Reddit using the ETHOS dataset. Compared BART and RoBERTa for binary and multi-class classification. BART achieved 70% F1-score and 58% top-1 accuracy, outperforming RoBERTa in distinguishing hate categories including gender, race, and religion.

NLPBARTRoBERTaTransformersHate Speech DetectionPyTorchClassification
Gas Turbine Sensor Fault Detection
Research

Gas Turbine Sensor Fault Detection

Time-Series Analysis for Predictive Maintenance

Developed machine learning approaches to analyse gas turbine sensor data and detect early signs of system faults. Used correlation and autocorrelation analysis to distinguish normal and faulty sensor behaviour. Applied time-series modelling techniques to support predictive maintenance strategies.

Time-Series AnalysisPredictive MaintenanceLSTMESNCorrelation AnalysisAzure DatabricksMachine Learning
Benchmark NLP Algorithm for Hate Speech Detection
Research

Benchmark NLP Algorithm for Hate Speech Detection

Deep Learning on Social Media

Tested 12 deep learning architectures, including RNNs, CNNs, transformer-based models (e.g., BERT, RoBERTa), and hybrid architectures (e.g., CNN + LSTM) to detect hate speech on social media platforms.

Deep LearningBERTRoBERTaSocial MediaCNNLSTMRNN

Published Research

"Short Comparative Analysis on Pretrained BART and RoBERTa in Detecting Hate Speech on YouTube and Reddit Platforms"

Presented at WiNLP Workshop co-located with EMNLP 2022

Education & Certifications

Academic Journey

Building a strong foundation in data science and machine learning through rigorous academic training

Tier 1 Global Talent Visa

Endorsed by UK Research and Innovation (UKRI)

Published Research

Presented at WiNLP Workshop co-located with EMNLP 2022

Multiple Distinctions

Achieved distinction grades in MSc Data Science and Graduate Diploma

Academic Timeline

Hover or tap a milestone to inspect the degree, institution, and key learning focus.

🎓

MSc Data Science

University of Greenwich, London

2023 – 2024
Distinction

Key Modules: Machine Learning, Applied Machine Learning, Data Visualisation, Statistical Methods for Time Series Analysis, Graph and Modern Databases, Big Data, Blockchain for FinTech Applications.

Completed projects involving comparative model evaluation (regression, classification, neural networks)
Advanced clustering analysis and optimisation of models for diverse datasets
Specialised in time series forecasting and big data processing

Continuous Learning

Professional Certifications

Recent certifications across machine learning, generative AI, statistics, industry simulations, and leadership development.

2

Industry

14

Technical

1

Leadership

Technical

Google Cloud

Introduction to Large Language Models

Skills you will gain

Large Language ModelingGoogle GeminiLLM ApplicationPrompt EngineeringGenerative AI
Show in Full
Technical

DeepLearning.AI

DeepLearning.AI TensorFlow Developer Specialization

Skills you will gain

Applied Machine LearningArtificial Neural NetworksClassification AlgorithmsComputer VisionConvolutional Neural NetworksData PreprocessingDeep LearningEmbeddingsForecastingGenerative AIImage AnalysisKeras (Neural Network Library)
Show in Full
Technical

DeepLearning.AI

Natural Language Processing in TensorFlow

Skills you will gain

Natural Language ProcessingRecurrent Neural Networks (RNNs)Artificial Neural NetworksText MiningEmbeddingsTensorflowMachine LearningGenerative AIData PreprocessingApplied Machine Learning
Show in Full
Technical

DeepLearning.AI

Introduction to TensorFlow for Artificial Intelligence, Machine Learning, and Deep Learning

Skills you will gain

Data PreprocessingTensorflowComputer VisionArtificial IntelligenceArtificial Neural NetworksImage AnalysisKeras (Neural Network Library)Convolutional Neural NetworksMachine LearningDeep LearningModel Evaluation
Show in Full
Technical

DeepLearning.AI

Convolutional Neural Networks in TensorFlow

Skills you will gain

Computer VisionImage AnalysisTensorflowDeep LearningModel EvaluationMachine LearningClassification AlgorithmsKeras (Neural Network Library)Data PreprocessingTransfer LearningApplied Machine LearningConvolutional Neural Networks
Show in Full
Technical

DeepLearning.AI

Sequences, Time Series and Prediction

Skills you will gain

Machine LearningForecastingArtificial Neural NetworksDeep LearningRecurrent Neural Networks (RNNs)Data PreprocessingApplied Machine LearningConvolutional Neural NetworksTime Series Analysis and ForecastingPredictive ModelingTensorflow
Show in Full
Technical

Eindhoven University of Technology

Improving your Statistical Inferences

Skills you will gain

Scientific MethodsStatistical Hypothesis TestingProbability & StatisticsR ProgrammingQuantitative ResearchStatistical InferenceSample Size DeterminationBayesian StatisticsData SharingResearchStatistical Analysis
Show in Full
Technical

Eindhoven University of Technology

Improving Your Statistical Questions

Skills you will gain

ExperimentationStatistical InferenceStatistical AnalysisResearch DesignData SynthesisSample Size DeterminationData SharingStatistical MethodsScience and ResearchResearchR ProgrammingQuantitative Research
Show in Full
Leadership

Ashorne Hill Management College

Leadership & Management Development Programme

Skills you will gain

Leadership & ManagementStakeholder CommunicationStrategic Decision MakingTeamwork & CollaborationNegotiationProject LeadershipApplied Research DeliveryIndustry Collaboration
Show in Full
Industry

Innovate UK

Knowledge Transfer Partnership Associate

Internal Listing
Industry

Forage

British Airways - Data Science Job Simulation

Show in Full
Technical

LinkedIn

Amazon Web Services Machine Learning Essential Training

Skills you will gain

Machine LearningArtificial Intelligence (AI)Amazon Web Services (AWS)
Show in Full
Technical

LinkedIn

Python Practice: Object-Oriented Programming

Skills you will gain

Python (Programming Language)Object-Oriented Programming (OOP)
Show in Full
Technical

LinkedIn Learning

Advanced NLP

Internal Listing
Technical

LinkedIn Learning

PySpark for Big Data

Internal Listing
Technical

LinkedIn Learning

Azure DevOps Fundamentals

Internal Listing
Technical

LinkedIn Learning

Generative AI Foundations

Internal Listing
Get In Touch

Let's Work Together

If you would like to discuss opportunities, ask a question, or collaborate, feel free to get in touch.

Ready to Connect?

I'm currently available for full-time positions in Machine Learning and AI.

✓ Full UK Work Rights✓ Open to Relocation

Looking for Collaboration?

I'm particularly interested in projects involving Applied Machine Learning, MLOps, Cloud Computing, NLP, and Time-Series Analysis.Let's discuss how we can work together to build innovative AI solutions.