Cleaning: from a production SQL database, we format the data in a (very) large denormalized table;
Featuring: we compute a lot (a lot) of features based on this table to create a large matrix;
Learning: we feed our homemade machine learning algorithm with this matrix and estimate a model.
You develop a lot of stuff in Spark and try different design to improve our performances using any kind of kick-ass technology, all of that using a slick git workflow;
You walk across the campus and cross some students, teachers and horses;
You can play with a cluster to get better performance (or just to learn). You can also use large machines to run your code;
You can use the sport facilities on campus during if you want;
You are also part of every decision regarding the evolution of our architecture (such as numerous NoSQL debates for instance);
You are also looking for new libraries, releases, papers, conferences, literally anything that’s linked to your field of interest. The campus library is fully stocked, and you can order what’s missing.
Curious is your middle name.
GitHub and StackOverflow are your favorite social networks.
You know why you’re using Eclipse instead of IntelliJ or the other way around.
You spent several years studying computer science or mathematics and more time playing with algorithms. You acquired good programming skills and know about clean code, tests and versioning.
You’ve seen things. Weird implementations and hairy publications. Everywhere.
You dream in Scala or Python (and you don’t talk about nightmares).
You’ve heard of functional programming, but you’re not supposed to be killer at it.
Also, you know about the machine learning ecosystem and everything revolving around distributed computing.
You can survive at least a month without water or food on Unix and you’ve picked your stand between emacs and vi.
A killer in Scala;
A hardcore Spark developer;
Fluent in python;
An experienced sysadmin.
Work in a highly challenging intellectual environment;
Wear sandals with socks (but why would you do that?);
Benefit from the campus infrastructure, which means that you could practice horse riding, golfing and water-polo, unfortunately not at the same time;
Use a lot of different technologies and try any that looks appealing to you;
Have access to adequate conferences, books and trainings if needed.