Ry Cutter

Logo

A PhD in Astrophysics turned Data Scientist and Software Engineer, driven by a passion for science communication and innovative problem-solving.

Multidisciplinary:
- Lead Innovation Data Scientist
- Customer Insights and Behaivour Analyst
- Machine Learning Engineer

Machine Learning, Embedded ML, Software/Firmware Engineering, Automation, Python, Time Series, Forecasting, NLP, Data Visualisation, Experimental Design, Physics, Statistics, Wearable Tech, Connectivity, Human Factors, UI/UX

View My LinkedIn Profile

My CV

View My GitHub Profile

LSST PLAsTiCC Astronomical Classification

This was my first encounter with data science! It was a kaggle challenge that had surprising overlap with my PhD. The goal was to identify signal sources in multi-colour phtometric and spectroscopic data in LSST (The Large Synoptic Survey Telescope.) The code is not great, but I was a young novice!

In essence, through data exploration, I was able to identify 2 clear groups of classes. Those whose redshift was 0 (intra-galatic sources) and those who were redshifted z>0 (extra-galatic). I was further able to see that most sources had spectroscopic variability. This meant I could create an extra feature using a multi-bandpass Lomb-Scargle.

For the Z=0 group I was able to get > 80% accuracy:

For the Z>0 group my accuracy was a little less great:

If I had more time I would have looked more deeply into classes 15,42, 52, 62, 67, and 90. The fact that they were the only ones where there was true confusion suggests they belong in the same super-class. The trebulations of a PhD means you don’t have much time to waste on fun challenges!

I believe for a first attempt at any machine learning my placement in the rankings was respectable.