::--ACTIVE RESEARCH PROJECTS--::
::DOCUMENT IMAGE PROCESSING::
Optical Character Recognition:
Optical Character Recognition (OCR) is a process to convert printed text Image to editable text that can be used and reformatted for other uses. We are developing a Bangla Character Recognizer that can recognize printed Bangla document and convert to editable text.
Project Page: http://www.bracuniversity.net/research/crblp/ocr
::SPEECH PROCESSING::
Imagine a day when you wake up, get washed and order your Personal Assistant (who most unfortunately is a machine) to make coffee and to read out the headlines from the newspaper.
Now, there are two basic hurdles we need to overcome. First, how do we make a computer understand what we are saying. And second, how do we make the computer say the things we want it to say. To solve the first problem we need speech recognition technology and speech synthesis for the second.
Emotion Recognition from speech:
This projects aims to conduct research on emotion recognition from speech of Bangla native speaker. Right now we are targeting to recognize four emotional states: unemotional (normal), anger, happiness and sadness.
Project page: http://www.bracuniversity.net/research/crblp/emr
Speech Synthesis:
Speech Synthesis is the artificial production of human
speech. A Text-to-speech system converts normal text to speech. We are
working on generating speech signal from Bangla Text.
Project page: http://crblp.bracu.ac.bd/demo/tts
Speech Corpus:
A speech corpus (or spoken corpus) is a database of speech audio files and text transcriptions in a format that can be used to create acoustic models of speech recognition and unit selection speech synthesis.This speech corpus contains read speech and continuous speech.
Project page: http://crblp.bracu.ac.bd/demo/speech_corpus/
::COMPUTATIONAL LINGUISTICS::
Corpus Analysis & Corpus Collection:
We have developed a tool for extensive corpus analysis on word frequency distribution. We currently have a corpus of one-year of
Prothom-Alo newspaper text and Charjapad and Baru Chandi Das Er Kabbo. We
have analyzed our corpus for regularities and anomalies in Bangla Word Usage.
Project Page: http://www.bracuniversity.net/research/crblp/corpus
Localized URL:
As world researchers thinking and developing standard on localized domain name so why don’t we think about it? CRBLP team is working this project to develop a standard for domain name in Bangla.
Project Page: http://www.bracuniversity.net/research/crblp/localURL
Lexicon:
We need a rich and informative lexicon for any kind Bangla Language
Processing. We have developed a wordlist of 160 thousand words with 1st step
parts of speech tagging.
Project Page: http://www.bracuniversity.net/research/crblp/lexicon
Wordnet:
We need a rich and informative lexicon for any kind Bangla Language
Processing. We have developed a wordlist of 160 thousand words with 1st step
parts of speech tagging.
Project Page: http://www.bracuniversity.net/research/crblp/wordnet
Parallel Corpus:
The aim of this project is to make the multi-lingual Bangla and English parallel corpus in electronic and aligned form. The parallel corpus can be used for translation memory, cross-language information retrieval, multi-lingual lexicography, terminology extraction, language study and language learning tool.
Project Page: |