Your task is to implement an automatic language identification system that can identify… 1 answer below »

Your task is to implement an automatic language identification system that can identify 5 written languages (English, French, German, Italian and Spanish). The system will input a text and output the language identified for this text. Assume that all words in the input text are written in the same language. The system consists of two stages: Learning and Identification. Details of the system are as follows. Stage 1: Learning languages from given text files using the n-gram technique (here n = 2, bigram). The following steps are required: • Access to a given folder named Learning and verify that it contains 5 text files which are English.txt, French.txt, German.txt, Italian.txt, and Spanish.txt. These text files are in UTF-8 format. • Do the following for each of the 5 text files: o Open the current text file, read its content, change all uppercase letters to lowercase ones, and remove non-alphabetic characters such as ~ ` ! @ # $ % ^ & * ( )

"Is this question part of your assignment? We can help"

ORDER NOW