My name is Rubén, and I am a Spaniard data scientist with a background in Physics. I consider myself a creative, empathetic, and open minded person. Maths, science, technology, and cinema are my passions. My friends often joke that I'm constantly questioning the why behind things.
I worked with a small team on the development of a classification model based on NLP (Python) with the aim of assigning a tax code to a product. We industrialized the solution using Azure Data Factory. Overall accuracy of the model 83%.
Development of an ELT process in Snowflake using pySpark, SQL, and Stored Procedres. Achieved full automation of the entire process, from new data arrival in Azure Blob Storage to the gold layer in SnowFlake.
I received training on SQL, dbt, data engineering, pySpark and business.
I worked on developing a battery of SQL tests to validate financial data. Part of the work involved optimizing existing queries, and in some tests, we achieved a 70% improvement in performance.
Private lessons in mathematics and physics from middle school to college. This work has taught me to explain hard concepts simpler as possible.
My first DL proyect, consist of a neural network that predicts if a twitter account is left-wing or right-wing based on the last 100 tweets. [IN PROGRESS].
FilmAffinity doesn't show in what platforms you can watch movies from a list. Using python and BeautifulSoup4 I have implemented a solution for this problem.
First deep learning project made by my own way. I implemented a CNN. I used MNIST DataSet.
When I knew about Zebra Puzzle I immediately thought in programming a solution for this problem. It has been solved by the "smart way" not by trial and error. The program obtains a solution in 9 iterations (time of execution on my laptop - 0.00085 secs) .
Intermediate level. I'm currently preparing Cambridge's B2 exam by my own.
I feel confident working with basic data structures, OOP, functional programming, numpy, pandas, pip and jupyter notebooks.
Advanced level using SQL for queries.
Advanced knowledge on maths (algebra, calculus and stadistics). I'm good solving hard problems.
Advanced knowledge on physics.
I have worked mostly with Scikit-learn, TensorFlow and keras.
I would describe myself as creative, assertive and open minded.
I'm familiar with linux environment and linux console. Pop! OS user.