Scaling existing translation technology to all language-pairs in the world is not feasible due to the lack of aligned parallel corpora and other resources needed by statistical machine translation algorithms. This project seeks to combine all existing translation dictionaries present in the world into a single resource, translation graph and to perform probabilistic inference on this graph to automatically infer translations between language-pairs for which no dictionary exists.


We have compiled the largest translation dictionary, PanDictionary, that contains over 4 times the number of translations compared to the English Wiktionary.