Category: machine learning

  • We’re all AI Engineers now.

    On non-determinism in Software Engineering. This year we’ve seen the rise of the term “AI Engineer” in job openings. How does this differ from Machine Learning Engineering, which is a combination of Model Training, Inference Optimization and traditional distributed systems engineering skills? The AI Engineer role is meant to address the fact that calling AI…

  • A good craftsman never blames his tools

    There has been a tweet that has been in my thoughts a lot in the last.6 months – regarding AI agents. In the end, it boils down to “Agents are the Hello World of the AI Engineer.” At first, I wasn’t sure about this sentiment. General agents may be some ways off, narrow agents can…

  • 5/4 – Papers I’m Reading

    Every week the firehose of new preprints continues for AI. There are so many papers of late, that it is a constant running gag in the ML community about how much research there is to keep up with. My purpose in cultivating more of an intentional practice of reading and engaging with whatever new ideas…

  • Training 100 models in public

    One of the things that I’ve admired of individuals like Andrej Karpathy or Dwarkesh is that the work that they do is deeply public. Things like Mini GPT are both a way of advancing one’s own knowledge about the depth of craft with transformers, as well as a way of teaching and transmitting that knowledge…