Mansur Can

About

Having worked for various projects, I am always looking for a next challenge.

Big Data Engineer & Machine Learning Enthusiast

I am dedicated to navigating the intricate landscape of data management through the lens of the Common Data Model (CDM) project. With a passion for transforming raw data into meaningful insights, I specialize in the seamless mapping of diverse data sources onto the robust framework of the CDM. My journey involves harnessing the power of Azure Synapse Studio and Azure Data Lake to orchestrate this transformation, ensuring precision and efficiency every step of the way. As an advocate for structured and standardized data, I am driven by the vision of unlocking the true potential of information. Join me on this expedition as I bridge the gap between raw data and the enriched possibilities that lie within the Dataverse, creating a harmonious synergy between technology and data-driven innovation.

In my previous involvement, I played a key role in a collaborative migration project where we skillfully transferred data from an RDBMS to HDFS within the Cloudera Data Platform using NiFi, seamlessly bridging the realms of on-premises and cloud environments. Particularly, I led the design of a Spark streaming architecture to capture real-time data from Kafka via Flask, a result of web scraping. This data was carefully stored in HDFS and smoothly integrated with Hive for comprehensive analysis, with the subsequent insights elegantly visualized through Superset, fostering an accessible connection to our Hive repository. This diverse journey showcases my ability to orchestrate data transformation across different platforms, consistently yielding actionable insights.

  • Freelance: Available
  • Remote: Available
  • Location: London, UK

Skills

I have developed a wide range of skills and abilities thorough my studies and work.

Operating Systems MacOS, Windows, UNIX/Linux
Distributions Cloudera, AWS, Azure, GCP
Environment Anaconda, Docker, GitHub, GitLab, Databricks
Database PostgreSQL, DynamoDB, MongoDB, AWS (RDS, S3)
Scripting Python, Pyspark, Scala, Java, R, MatLab, SQL, HiveQL, HTML
Data Processing Apache Spark, Spark Streaming, Pandas, NumPy
Web&App Servers Goggle Firebase, Flask, Selenium, HTML, CSS, JavaScript
Testing Tools Unit test, Pytest
Cloud Services Amazon AWS (EC2, S3, RDS, EMR, DynamoDB, Redshift, CloudFormation); Microsoft Azure (Synapse, Data Factory, Blob, Data Lake, Databricks, DevOps, Power BI, Power Apps); GCP
Hadoop Ecosystem Hadoop, Cloudera, HDFS, Hive, Kafka, HBase, Cassandra, Zookeeper, Sqoop, Spark, Spark performance Tuning/Optimization, YARN, HBase, Nifi, Oozie, Impala

Resume

I have a strong background in research and have worked on a wide variety of projects.

Education

Specialist AI and Data School Program

AI Core

Certified in the practical application of AI & Data Engineering using industry-standard tools including:

  • Computer Vision Project
    Trained a computer vision model and used Tensorflow to detect whether Rock, Paper or Scissors is shown to the camera in real-time and with a high accuracy. Used the OpenCV library to access the webcam and play Rock Paper Scissors with the computer using the image from the camera.
  • Data Collection Pipeline Project
    Develope a module that scraped data from various sources using Selenium Curated a database with information about and stored it on an AWS RDS database using SQLAlchemy and PostgreSQL Performed unit testing and integration testing on the application to ensure that the package published to Pypi is working as expected Used Docker to containerize the application and deployed it to an EC2 instance Set up a CI/CD pipeline using GitHub Actions to push a new Docker image.

PhD in Physics

Cardiff University, UK

  • Throughout a year and a half of my PhD project, I concentrated on nanoparticle characterization through a combination of optical microscopy, image analysis, and advanced simulation techniques. My utilization of Comsol and MatLab played a pivotal role in the exploration of intricate nanoparticle behaviors, enabling accurate modeling and simulation of complex interactions at the nanoscale. This integration of Comsol and MatLab enriched my research, enhancing the depth and precision of nanoparticle analysis and characterization.

MSc in Software Engineering

Westmister University, UK

Postgraduate Diploma in Biomedical Engineering

Brunel University, UK

  • 3 Electrode Graphene on Paper Biosensors for DNA Test project
  • Software programs were used in projects are CFD, Abaqus, MatLab

PGCE (Secondary) Science

Goldsmiths, University of London, UK

Postgraduate Certificate in Nanotechnology

University of Oxford, UK

BSc in Physics

Karadeniz Technical University, Turkey

  • I studied Pascal programming language during my undergraduate degree.

Portfolio

The projects demonstrate my ability to solve problems, write clean and efficient code, and work on a team. It is important to have a diverse range of projects that showcase my abilities and highlight my unique strengths as a software engineer.

  • All
  • App
  • Web
  • Program
Pyspark

Combined Four Countries

Pyspark

facial-recognition

Facial Recognition IOS Application

Swift and Python

OpenCV

Computer Vision

Open CV Teachable Machine

Learning Model for Handwritten Digit Recognition

CoreML Tensorflow Keras

Bootstrap

Turkish Tutor Website

Bootstrap Html Css

Webscraper

Selenium Aws Docker

Portfolio

Portfolio Resume Website

Bootstrap Html Css

Facebook Mark

Facebook Marketplace Search Ranking

Pytorch Kubernetess Kubeflow EKS

Library

Having a software engineer library can be extremely beneficial for professionals in the field, as it allows them to continuously learn and improve their skills. It can also help them stay up-to-date with the latest technologies and best practices in the industry.

Python

This is a basic cheatsheet for Python. It gives summary information about data types, loops, indexing, dictionaries.

R

These are the study files from university which would help you to study R on your own weekly pace.

Sql

This document shows some basics of Sql which is a language to talk to databases.

Git

Some basic commands of Git to push codes to a repository such as init, add, commit, push.

VsCode

Some useful VsCode mac keyboard shortcuts to speed up daily operations.

OOP in Python

Basic object oriented programming in Python explanation in powepoint with simple example.