Introduction We have seen an exponential growth in the volume of users on Online Social Networks (OSN) in the Indian subcontinent over the past few years. This has prompted the attention of several stakeholders and first responders to turn to OSN to make decisions and plan their next move. In multilingual societies such as India, it's ubiquitous to find large volumes of code-mixed online discourses, tweets, and posts. The process of performing Language and Text Analysis on any such data is not a trivial task as all of the traditional tools for NLP are based on English and do not work well for code-mixed data. In this blog post, I would be specifically talking about Hindi-English (Hi-En) Code-Mixed data (however most of the concepts apply to other forms of code-mixing as well.) What is Code-Mixing? Code Mixing is a natural phenomenon of embedding linguistic units such as phrases, words or morphemes of one language into an utterance of another (Muysken, 2000; Duran, 199...