Fabricio Arend Torres

Machine Learning - PhD Student

About Me

Hi, I’m Fabricio, a Computer Scientist working in the area of Machine Learning, Data Science & Statistics. I’m currently a PhD Student at the University of Basel, where I’m researching ways to model different kinds of ornithological data. The applications range from modeling migration events based on radar data to estimating species distribution models of birds based on citizen-science observations.

On the methodological side my interest lie in Gaussian Processes, Density Estimation (including deep generative models and generative flows), and Implicit Representation Networks. Within the University I’m further involved in mentoring Bachelor’s and Master’s theses and assist in the ‘Scientific Computing’ lecture held at our department. Finally, I regularly review papers for some of the major ML Conferences (NeurIPS, ICML, ICLR).

In more practical terms I have worked quite a bit with the most common Machine Learning and Data Science frameworks, notable mentions being Tensorflow, Pytorch and libraries for scalable Gaussian Process (GPflow, GPytorch). Aside from that, I’m comfortable with the general scientific stack of libraries within python (numpy, scipy, pandas, scikit-learn, matplotlib). As the work within my projects includes extensive preprocessing and exploratory analysis, I have some experience working with spatiotemporal geo-data and the data formats common within that area. While there is a wide range of frameworks and languages I haven’t worked with, I always enjoy delving into new technologies and frameworks.

A more complete list of programming languages, frameworks and tools I’ve worked with is listed below. The level of familiarity is indicated with stars, ranging from basic concepts () to extensive experience (**).

  • Programming languages: Python (**), R (), Javascript (*)
  • Deep / Machine Learning: Tensorflow1 (*) & 2 (). Pytorch (), scikit-learn
  • Gaussian Processes: GPflow(**), GPytorch()
  • Exploratory Data Analysis & Preprocessing: Python Scientific Stack (**)
  • Data Extraction: pandas(), SQL(), Google Earth Engine()
  • Workflow: conda/virtenv/pip(), git(), linux(*), SLURM()

Projects

Estimating relative abundance of nocturnal migrant species

todo

todo

Scalable Gaussian Processes for spatio-temporal interpolation of migration events

todo

todo

Simulating Stochastic Paths for estimating path-level uncertainty

todo

todo

Experience

University of Basel

Student Assistant for Biomedical Data Analysis

Nov 2016 - Dec 2017

Assistance in a research project for analysing Next Generation Sequencing Data in context of modeling HIV treatment outcomes. Here I gathered first practical experiences with processing large scale data, aligning genome sequencing data, and general feature engineering.

Education

University of Basel

MSc Computer Science with focus on 'Machine Intelligence'

2016 - 2018

In context of my master’s thesis ‘Sampling and Annealing for Dependency Subnetwork Estimation’ I implemented and extended a model for estimating sparse dependency networks between variables. The model was applied to HIV data for detecting possible connections between resistance-relevant mutations and treatment outcome.

This project allowed me to get some insight into more classical Bayesian models as well as MCMC sampling schemes.

University of Basel

BSc Computer Science

2012 - 2016

todo

todo

A Little More About Me

todo