Tag: machine learning

  • 5/4 – Papers I’m Reading

    Every week the firehose of new preprints continues for AI. There are so many papers of late, that it is a constant running gag in the ML community about how much research there is to keep up with. My purpose in cultivating more of an intentional practice of reading and engaging with whatever new ideas…

  • Training 100 models in public

    One of the things that I’ve admired of individuals like Andrej Karpathy or Dwarkesh is that the work that they do is deeply public. Things like Mini GPT are both a way of advancing one’s own knowledge about the depth of craft with transformers, as well as a way of teaching and transmitting that knowledge…