After a gap of almost two years, I am happy to announce the second official release (version 0.2.0) of Anubadok a free (as in freedom) machine translation system for English to Bengali. Anubadok is written in Perl and it uses Penn Treebank annotation system for natural language processing. To run Anubadok 0.2.0, you need to have Part-of-Speech tagger GPoSTTL installed in your system. The Anubadok system can be accessed online using the interface Anubadok Online run by Ankur.
First official release (ver. 0.1) of Anubadok was an experimental release which mainly served as a proof-of-concept for an open-source English to Bengali machine translation system.
With the release of version 0.2.0, I am glad to upgrade its official tag from “an experimental software” to “a software under development” with clear-and-specific implementation targets. However given the nature of the project, there are no specific time-frames for future releases. Further, given machine translation is considered an open research topic in Computational Linguistic, you should expect to see some surprises
even for well implemented situations. Specially, if you are comparing results of machine translations with human translations.
In English, there are four types of sentences: Declarative, Imperative, Interrogative and Exclamatory. These sentence types further fall into four basic sentence type: Simple, Compound, Complex and Compound-Complex.
The table below gives approximate status of implementation for each sentence type in the current release and inversely it gives the targets for future implementations.
| Declar. | Imper. | Interro. | Exclam. | |
| Simple | W | W | W | M |
| Compound | M | M | M | M |
| Complex | N | N | N | N |
| Compound – Complex | N | N | N | N |
W: Well implemented
M: Moderately implemented
N: Not/Not-well implemented
Anubadok does not yet have any code to handle Complex or Compound-Complex sentences, not even moderately. This is where next push for development is needed.
Few other salient features of this release:
- The execution method of Anubadok system has been re-written. Anubadok itself has been implemented as Perl module. This means one can now access Anubadok in a Perl program directly by including Anubadok libraries (Perl modules) or in any other program by using appropriate Perl module wrapper.
- The notion of “testsuites” has been introduced for Anubadok. For a given English sentence, it compares a machine translated sentence with the expected Bengali sentence. This is quite an useful tool while adding new features or doing some experimentations as it would ensure that already implemented algorithm are not affected.
- Anubadok system can now handle several kinds of input documents including plain text files, any XML documents, HTML files with in-line javascript, CSS. Further, as earlier, it is capable of translating Portable Object (PO) files directly.
- Anubadok packaging has been completely reorganized to ensure that it has the basic structure of a standard Perl package. Consequently, Anubadok can be installed following the method of standard Perl module installation.
- Anubadok-0.2.0 comes with an updated dictionary having 15K+ entries in its database. This is almost double the number of entries it had in 0.1 release. Credit for this goes to all the contributors of Ankur English to Bengali dictionary project. Anubadok’s dictionary are now updated regularly using database dumps of Ankur E2B dictionary.
- Anubadok has now moved to its new website hosted by SourceForge.
http://anubadok.sourceforge.net
Latest source codes of Anubadok can be downloaded from the “trunk” branch of its SVN repository.
- Anubadok Online, the online interface to Anubadok system, has been upgraded substantially. It runs directly using SVN version of Anubadok engine. User contributed new entries though this interface are submitted automatically to Ankur E2B dictionary project.
- A brief document is now available for download as a PDF file from its website. It describes the internal working and the algorithm used by Anubadok system by considering specific example sentence.
Hi,
Golam Mortuza Hossain bhai, I am a Bengali wikipedian, lives in Kolkata. I frequiently use anubadok. Thank you for this great utility. Can I use this utility offline?
I have no knowledge about perl and Penn Treebank . I download source code. But I can’t nothing with this.
on download page is below statement………
http://anubadok.sourceforge.net/download.htm
After downloading the source code from SVN repository you can do the following to run anubadok:
* cd anubadok
* echo “I love you.” | ./bin/anubadok
I dint understand anything……
Please help me to do and what to with a step by step process….thanks in advance.
Regards,
Jayanta Nath
jayantanth@gmail.com
jayanta@hilandcal.com
Thanks for taking this huge step. When this will be completed it will be like a dream come true. Please let me know if I can contribute in any of your research and development. I am a voluntary (English-Bengali-English) translator in many sites.
@Rezwan,
You are always welcome to contribute to Anubadok project. As you can guess, Anubadok project needs multi-directional inputs equally from technical side as well as non-technical side.
If you have interests in coding then you can contribute new or improve its current algorithms.
The biggest obstacle for Anubadok project currently is the lack of a comprehensive Unicode-based English to Bengali dictionary database. It is unfortunate that we do not have such a free (open license) dictionary. So given your expertize, you might be able to help the project in this direction.
Also, you can always file bug-reports by giving the example sentence where Anubadok fails to translate correctly. You can file bug-reports by visiting this page
http://sourceforge.net/tracker/?func=add&group_id=210233&atid=1012912
Please feel free to let me know if you have any other suggestions.
Thanks,
Golam
Congratulations for the woderful work done with english-bangla dictionary and the software. I am very keen on creating glossary for english- bangla legal terminologies used in America and UK. I would be highly greatful to you is you could send me some links for this .
thanks mm
@mmishra
Thanks for your message. I really don’t know whether there exists any such glossary, available for download.
Thanks for your message. I really don’t know whether there exists any such glossary, available for download.
Not correct Translation
I am playing guiter. I am playing football. I am playing drama.
আমি guiter চালাচ্ছি। আমি ফুটবল চালাচ্ছি। আমি drama চালাচ্ছি।
I have a lot of problem of safety about the office cash
I want to properly translation. Please help this problem
Md. Masudur Rahman
e-mail: masudsaad@in.com
Assalamu alykum, i can’t see the bengali words on my computer, i cant read any bengali news papers from wrbsite… even whenever i use anubadak some sign comes like THIS (|||| ) please advise me how can i see the bengali fonts on webpages…i have installed bijoy 2006 yet i can see any bengali fornts on my computer….
wassalam
@taliba,
You need to install at least one Unicode font in your system. Bijoy is not a Unicode compliant standard font. You can get Unicode fonts from Ekushey
http://ekushey.org/?page/otf_bangla_fonts
Best,
So far Good Job. Appreciate it. Thank to those who had their hard work behind this.
very helpful site
thz…….
thanks a lot, well done, congratulations
Onubadoker uddokta o kormokortaderke ovinondon.
N.B. Jhenidah er prothom bangla website http://GraphicsWorldBD.webs.com
মুতুজা ভাই,
আমার সশ্রদ্ধ সালাম নিবেন। অনুবাদক এর অফিসিয়াল সাইট ও আপনার এই সাইটটা ইংরেজীর পাশাপাশী বাঙলায় পড়া ও জানার সুযোগ করে দেওয়া হলে অনেকেই অনুবাদক সম্পর্কে সহজে জানতে পারতো ও নুবাদক ডেভলপমেন্ট এর সহায়তায় এগিয়ে আসতে পারতে। আশাকরি বিষয়টি গুরুহ্বের সথে বিবেচনা করবেন।
অনুবাদকের ডাটাবেজএ শব্দ এন্ট্রি করতে পারছি না। আমি অভ্র কি বোর্ড ব্যাবহার করি। অভ্র কি বোর্ড দিয়ে ডাটা এন্ট্রি করতে পারছিনা। কোন কি বোর্ড ব্যাবহার করলে ডাটা এন্ট্রি করাযাবে আশাকরি জানাবেন।
ভালো থাকুন।
Dear Sirs, thank you for this Bengla language sight.
In past 2 years I have used it to assist me with sending short Msg’s in Bengla to my 5 year old, little girl, I support through World Vision.
I have developed a most loving rapport with Jinuk, who is now 7 years old. She is the brightest child I have known, including my own family! Eager to learn and to grow her list of knowlegeable ideas.
This small effort on my part to send a message she can read herself has contributed to this bonding process.
Today, I typed my message, but the usual Bengla script did not appear, only blocks. The request for translation was there, as before but no message. Please advise me what I must do to activate this wondeful service so I may continue to reach out to her. English translations that are sent me from her letters usually contain three lines or less, which hardly helps us to explore each others lives and our entire Families.
If there is a nominal charge or a way to set this up, how willing I will be to learn etc. Yours truly V. Davidson.
PS Are you aware of a contributor from Barrie, Ont, Canda, who would know Bengla, not Bangladesh (I unwittingly upset a young woam from Bangladesh, who thought I was supporting this child in a ‘private’ school as oposed to a ‘mission’ school). Letters then would actually be read, which young lady could not do.
My blessings and most grateful appreciation to all.
Valerie Davidson
Please contact me at my e-mail address re help on joining your wonderful language sight as I need help re using.
Thank you Valerie Davidson v.davidson@rogers.com
good job
Realy its nice..thanx..
Something is better than nothing.
অনুবাদক সাইট এ কিভাবে নতুন শব্দ এড করা যায়, এ বিষয়ে সংশ্লিষ্ট সাইটেই একটি দিক নির্দেশনা দেয়া থাকলে এবং নতুন শব্দ এড করার ব্যবস্থা থাকলে সাইটটি অনেক দ্রুত পরিপূর্ণতা লাভ করতো। আশা করি কর্তৃপক্ষ ব্যাপারটা খেয়াল করবেন এবং দ্রুত ব্যবস্থা নিবেন। সর্বোপরি সংশ্লিষ্ট সবাই কে অনেক ধন্যবাদ।
A great job done for us “Anubadok Online “. Thanks a lot. It is a great job for our nation. May all developer of “Anubadok Online ” live long.
URL: http://bengalinux.sourceforge.net/cgi-bin/anubadok/index.pl
ভাই গোলাম মোরতুজা হোসেন,
‘কম্পিউটারে বাংলার’ ক্ষেত্রে আপনার নিবেদিত প্রান অবদান চির স্মরনীয় হয়ে থাববে। সে জন্য প্রথমে আপনাকে আমার আন্তরিক শুভেচ্ছা এবং অভিনন্দন।
যে সব মানুষের র্দুলভ কীর্তি মানবতার কল্যানে সার্বজনীন হয়ে যায় তারা নি:সন্দেহে মহামানব। আর যারা তাদের জীবদশায় ব্যক্তিগত লাভ লোকসানের উর্দ্ধে উঠে সে কাজটি করে তাদের আরও বেশী কিছু বলা উচিত।
আবারও আপনাকে আমার আন্তরিক অভিনন্দন এবং ঈদের শুভেচ্ছা।
Arif Khan
arifuk150@gmail.com
Dallas, USA
Darun! Apnake dhonnobad
http://directorybd.blogspot.com/
i love u.
I want english sentence to bangla mening convert soft
English to Bengali Machine Translation System Anubadok is now 0.2
Dear Sir,
I want English to Bengali Machine Translation System soft
Thank you for your site, the program is helpful, it has increased my thirst practicing Bangla script.
বাংলা হতে ইংরেজী অনুবাদের সফটওয়ার বা ওয়েব সাইট থাকলে জানাবেন।
আপনাকে অভিনন্দন
অভিনন্দন গ্রহণ করুন।