Quantifying Linguistic Variation Speedrunning my PhD thesis: What is a language, domain, or even a task? machine learning natural language processing phd