We are experimenting to see the effects of dataset composition (datacomp) on LLMs.