I wasn’t expecting this to work so well! The one weakness seems to be sometimes with generic scripts (that are hard to identify, for example a custom Python script) Gemini says the application is the workload manager (e.g., Slurm). I think this is probably OK because there is enough good signal in there to get a labeled dataset for other things.