Eklavya Sarkar

MSc Data Science & BSc Computer Science Graduate

Currently a PhD student at EPFL and research assistant at Idiap Research Insitute,
working in the Speech and Audio Processing group, under Dr. Mathew Magimai Doss.

Funded by EvoLang project, and affiliated with TTF Tech.





Work Experience

Speech Processing

Research Assistant

Idiap Research Institute

Supervisor: Dr. Mathew Magimai Doss, Speech and Audio Processing Group


  • Funded by EvoLang Project, TTF Tech ASR.
  • Inference of Symbolic Structure given Minimal Supervision.
  • Low Resource Speech Processing.
March 2021 - Present



BiometricsMLDLGANs

Research Intern

Idiap Research Institute

Supervisor: Dr. Sébastien Marcel, HOD Biometrics Security and Privacy Group

  • Implemented different techniques to generate traditional and StyleGAN2-based face morphs.
  • Investigated vulnerabilities of modern facial recognition systems against morphing attacks.
  • Currently researching detection techniques for such attacks to publish paper by November.

May 2020 - Feb 2021
(10 months)



SEDSR&D

Intern

CERN

Project Manager: Dr. Archana Sharma, Principal Scientist, CMS Experiment

  • Refined efficiency of production code by implementing requested features on Python scripts.
  • Improved code used for testing detector in a QC stand by adding an step-size feature.
  • Created method for configuring detector’s electrical state with custom values.
  • Published real time gas levels of a mixer by writing code to send data to a server via an API.

July 2017 - September 2017
(3 months)



Thesis

DLML160 Pages

Facial Information Extraction

MSc, Computer Vision, Convolutional Neural Networks
  • Attempted to use state-of-the-art deep learning techniques to build models which take an image as input.
  • Performed facial detection, recognition, and emotion classification on the present individuals on the images.
  • Achieved 95% test accuracy on facial recognition with convolutional neural networks and hyper-parameter tuning.
  • Built separate models for tasks such as emotion classification before combining them into an end-to-end models.
  • Optimised performance with DL best practices: data augmentation, batch-normalisation, cross-validation.
2018-19
Grade: Distinction

SEML200 Pages

Kohonen Self-Organising Maps

BSc, Computer Vision, Pattern Recognition
  • Implemented unsupervised machine learning neural network from scratch without using any specific ML library.
  • Trained back-end model on 3 different open-source datasets to test neural network’s efficiency and scalability.
  • Developed front-end GUI for interactive data visualisation before & after clustering and dimensionality reduction.
  • Wrote extensive thesis covering all aspects of project such as system design, algorithmic optimisation, scalability.
2017-18
Grade: 90%

DS50 Pages

Exoplanets: Discoveries and Prospects

Research, Data Analysis, Literature Review
  • 2019 Update: Dider Queloz has since won the Physics Nobel Prize !
  • Conducted literature review on Exoplanets, with inputs from Didier Queloz, co-discoverer of the first exoplanet.
  • Showed correlations between possibly habitable planets and core laws of physics by analyzing open-source DB.
  • 50 page report selected among top 2013 student scientific projects in Geneva canton and Pays de Gex.
  • Invited to present project at a public ‘Science Sharing’ event at CERN's Universe de Particules museum.
2012-13
Grade: 6/6

Talks

DLDiarizationNCCR

Automatic Speech Segmentation

14 June 2021
HMM Tutorial Problems

Hidden Markov Models

8 Dec 2020
DLStyleGAN2Morphing AttacksVulnerability Analysis

Vulnerability Analysis of Face Morphing Attacks from Landmarks and GANs

4 Nov 2020
DLGANsVAEs

Generative Adversarial Networks

30 April 2020
DLCNNs

Convolutional Neural Networks

30 April 2020

Publications

Pre-printStyleGAN2Face RecognitionBiometrics

Face Morphing

Vulnerability Analysis of Face Morphing Attacks from Landmarks and Generative Adversarial Networks
Morphing attacks is a threat to biometric systems where the biometric reference in an identity document can be altered. This form of attack presents an important issue in applications relying on identity documents such as border security or access control. Research in face morphing attack detection is developing rapidly, however very few datasets with several forms of attacks are publicly available. This paper bridges this gap by providing a new dataset with four different types of morphing attacks, based on OpenCV, FaceMorpher, WebMorph and a generative adversarial network (StyleGAN), generated with original face images from three public face datasets. We also conduct extensive experiments to assess the vulnerability of the state-of-the-art face recognition systems, notably FaceNet, VGG-Face, and ArcFace. The experiments demonstrate that VGG-Face, while being less accurate face recognition system compared to FaceNet, is also less vulnerable to morphing attacks. Also, we observed that naıve morphs generated with a StyleGAN do not pose a significant threat.
October 2020

Projects

RLDLDQNDDPG

Deep Reinforcement Learning: Flappy Bird

Deep Q-Learning Network, Deep Deterministic Policy Gradient, Experience Replay

Attempted to a develop model which is able to learn to play Flappy Bird, and surpass human level scores by using Reinforcement Learning techniques. Specifically investigated Deep Q-Learning networks to develop an overview of the problem and deeper understanding on reinforcement learning techniques. Wished to showcase how computer vision and deep neural networks such as convolutional neural networks can be used in the context of reinforcement learning as well.

2019

NLPMLDL

Kaggle Competition: Toxic Comment Classification

Multi-Label Classification Problem

Attempted to solve a Kaggle competition in a group of three to the best of our abilities. Specifically strove for implementations beyond the exsiting classical ones, and attempted to develop a model which is well-adapted and fine tuned to the specific problem at hand. Implemented a Naive-Bayes Bag of Words model, Random Forest, Extra Trees, and compared their results with the Log Regression, Convolutional Neural Network, and Long Short-Term Memory Recurrent models.

2019

MLBayesianStats

Bayesian Machine Learning

Hamiltonian Monte Carlo Stochastic Methods, Automatic Relevance Determination

Used Bayesian modelling methods, specifically Hamiltonian Monte Carlo, to approximate Gaussian posterior distributions on a multivariate regression task to derive a good predictor from the dataset, and estimate which of the input variabels are relevant for prediction.

2019

NLPML

Open Information Extraction

Speech Tagging, Named Entity Recognition, Relation Extraction, Kitchen Sink

Attempted to summarise Jules Verne's 20,000 leagues under the seas' by training a classifier that indicates which of the part of speech tags each word is. The approach was based on Identifying Relations for Open Information Extraction (Fader, Soderland & Etzioni). To this end, Glove word vectors were employed to implement a logistic one vs all kitchen sink model, and attempted speech tagging on word and sentence levels, named entity resolution and relation extraction.

2019

SE

Robotics I

Localisation, Pathfinding, Navigation, Calibration, Object Detection

Wrote a program using the Java LeJOS framework that enables a robot to explore the arena which contains a small number of obstacles, placed at random locations. There was a single coloured sheet of paper which the robots had to be able to detect using the colour sensor which also signifed the end location, to which the robot had optimally navigate back to the ending position.

2017

SE

Robotics II

Scout, Doctor, Agents, Jason

Wrote a program using the Java LeJOS framework allowing a robot to determine it's starting location in the arena, and optimally work its way to the pre-determined ending position using scout and doctor agents while avoiding the possible obstacles.

2017

AndroidSE

Android Food App

Full stack development

Scran is a user-oriented application that aids in the decision-making process when choosing a restaurant, and more specifically a dish. Scran will maintain, search and track user and restaurant data to help its users to choose the dish they didn’t know they wanted.

2017

SE

Moving Average Filter

Generate, Filter and Display data

Wrote C++ in Xcode to generate random plot and noise values of a sinusoidal function using signal characteristics as parameters, which would then be handled by the designed event driven panels and data structures in LabVIEW, and subsequently transferred to Matlab to be displayed in both filtered and unfiltered states.

2014

Competitions

3rd PrizeDL

International Create Challenge

Adversarial Attacks

  • Developed model to detect and combat adversarial attacks using Foolbox toolkit.
  • Implemented website to evaluate the robutness of a given model to adversarial attacks using a specific metric.
  • Awarded 3rd place in overall ICC2020.
2020

SEiOS

Facebook Hackathon 2015

iOS Revision App

Developed iOS app with first generation Swift on xCode.

  • Goal was to give students a platform to revise and prepare for exams on the go.
  • Content specifically tailored to the common first-year Bachelor course.
  • Option of adding content for additional modules and courses by users.
  • Hopefully improve the student pass rate at EPFL by providing feedback and tips.

2015

News

CERN Intern

Featured on University of Liverpool Student News

FULL Student feature: My summer Internship at CERN

"Arriving wide-eyed at the main lab on the first day, I discovered that I was among twenty other excited students, from all over the world, ranging from Thailand, Brazil, France, US, India, Italy and many others, all of whom had arrived at different moments during the summer, meaning there was little time for individual introductions to the lab and explanations of the various hardware components and the software code base."

October 2017

Exoplanet Project Presentation

Featured on Colloc Transfrontalier TPE-TM

Selected to present on stage my research project on Exoplanets at the Colloque Transfrontalier: La Science en Partage (a public ‘Science Sharing’ event) at CERN’s Universe of Particles museum.

October 2013

Education

Ecole Polytechnique Fédérale de Lausanne

PhD Machine Learning
- Deep Learning

March 2021 - Present

University of Bath

MSc Data Science
- Statistics
- Machine Learning I & II
- Neural Computing
- Bayesian Machine Learning
- Reinforcement Learning
- Applied Data Science
- Software technologies for data science
October 2018 - September 2019
Grade: First Class

University of Liverpool

BSc Computer Science
- Efficient Sequential Algorithms, Complexity of Algorithms
- Robotics and Autonomous Systems, Multi-System Agents
- Biocomputation, Artificial Intelligence
- Complex Information and Social Networks
- Software Engineering, Group Software Project
- Automata Theory
September 2015 - June 2018
Grade: First Class

Ecole Polytechnique Fédérale de Lausanne

BSc Mechanical Engineering
- Calculus I & II
- Linear Algebra
- General Physics I & II
- Material Science
- Mechanics of Structure, Mechanical Conception

September 2013 - June 2015
(Uncompleted)

Skills

Languages and Libraries
  • Bash
  • Tools and Technologies

    Extra-Curricular

    President

    Student Residence Hall's Committee
    • Elected President of Hall Committee by ballot vote majority to represent 270 students.
    • Enhanced residents’ experience by taking charge and managing events throughout the year.
    • Responsible for formulating outline and implementation of vision for Hall’s community and life.
    • Led 10 member committee through generating team vision and chairing weekly meetings.
    • Maintained professional relationship with residence staff, guild of students, accommodation office and the university.

    2017-18

    Interests