Automatic Langugage Identification in Text






This Automatic Language Identification(LID) system is based on Neural Network.
The baseline model has achieved 81.7% & 97% accuracy over 100 and 1K characters
length texts.
Hence, the more the length of the sentence, the better the result.

The language identification model is currently limited to ten popular languages -
Bengali, English, French, German, Hindi, Polish, Russian, Tamil, Telugu, Turkish.

Some examples of different languages:

Bengali: আজ আমরা মেশিন লার্নিং ব্যবহার করে একটি স্বয়ংক্রিয় ভাষা শনাক্তকরণ সিস্টেম প্রদর্শন করছি।
English: Today we are demonstrating an automatic language identification system using machine learning.
French: Comment ça va aujourd'hui?
German: Heute demonstrieren wir ein automatisches Spracherkennungssystem mit maschinellem Lernen.
Hindi: आज हम मशीन लर्निंग का उपयोग करके एक स्वचालित भाषा पहचान प्रणाली का प्रदर्शन कर रहे हैं।
Polish: Dziś demonstrujemy automatyczny system identyfikacji języka za pomocą uczenia maszynowego.
Russian: Сегодня мы демтрируем автоматскую систе идентификации языка с помощью машинного.
Tamil: இயந்திர கற்றலைப் பயன்படுத்தி தானியங்கி மொழி அடையாள முறையை இன்று நாங்கள் நிரூபிக்கிறோம்.
Telugu: ఈ రోజు మనం యంత్ర అభ్యాసాన్ని ఉపయోగించి ఆటోమేటిక్ లాంగ్వేజ్ ఐడెంటిఫికేషన్ సిస్టమ్‌ను ప్రదర్శిస్తున్నాము.
Turkish: Bugün makine öğrenmesini kullanan bir otomatik dil tanımlama sistemi gösteriyoruz.

Please enter at-least 10 chracters

Detected Language: None