On the new year’s day, the Anandabazar Patrika, the largest Bengali newspaper from West Bengal, begins one of their editorial with the sentence — “সাম্প্রতিক পশ্চিমবঙ্গের জনপ্রিয়তম শব্দবন্ধ ‘পরিবর্তন চাই’” (“The most popular words in recent Bengal — ‘We want change'” ). The same editorial ends with the proposition — “নূতন বৎসরের মূলমন্ত্র হউক ‘পরিবর্তন চাই’” (“Let the mantra for new-year be — ‘we want change’ “). Leaving aside the politics, there is a serious need of change in technology adoption in West Bengal and that is to help its beloved language Bengali to survive in its digital avatar.
In their own words, the Anandabazar Patrika (ABP) may sound like a champion of change but in practice they are no different. Being a leader in Bengali publishing industry, one might expect them to be in forefront in improving the digital standard for Bengali. Unfortunately, their action speaks just the opposite. They continue to use non-standard, bitstream font technology in their website instead of using international standard, the Unicode. One of their “supported browser” is Netscape Communicator whose official support has ended in 2008. They also recommend the use of Firefox plugin Padma. Being the author of ABP support in Padma, this seems rather strange to me. They are asking users to convert their contents to Unicode (by using Padma) rather than serving their contents directly using Unicode.
It may be mentioned that like many other non-Latin languages, digital representation of Bengali texts suffered from a lack of encoding standard in its early phase. However with the advent of Unicode, the universal encoding standard, this is no longer an issue. The Unicode standard has been widely adopted across different operating systems and all recent versions of Windows, Mac or Linux support Unicode natively. According to a statistics from the internet giant Google, the Unicode is most frequently used encoding on the internet since 2008.
Nevertheless, there has been a significant increase in Unicode adoption also for Bengali in recent past. Let me mention some of them.
Bangladesh:
In may be noted that Bengali is the national language of Bangladesh and they too suffered from the same problem. However, there has been a dramatic increase in adoption of Unicode lately. The largest news paper from Bangladesh by circulation, the Prothom Alo, has now switched to Unicode. Until recently they were using their own proprietary encoding. Other prominent news papers that have switched to Unicode from proprietary encoding are Amar Desh, Sangbad, Daily Sangram, Manab Zamin, Samakal.
West Bengal:
The West Bengal government has now adopted Unicode 5.0 as the standard encoding for Bengali. Their official website Banglar Mukh has finally switched to Unicode. Furthermore, with their funding the entire literary work of Nobel laureate Rabindranath Tagore has been released using Unicode. Tagore’s works are now in public domain due to the expiration of copyrights. The credits for these encouraging developments must go to the Society for Natural Language Technology Research and the company behind some of these implementations, the MAT-3 Impex.
Coming back to the technology front, there is now a new kid in the great browser arena, the Google Chrome. This snappy browser while supports Unicode natively, currently uses a buggy font for Bengali by default. This causes some Bengali texts to appear garbled. Most of these issues can be solved by simply changing its default font. To do so click on
Wrench-->Options-->Under the Hood-->Change fonts and language settings
and then choose the font of your choice for Bengali. For example in Ubuntu you can choose Freesans or Freeserif. These fonts have nice glyphs for Bengali.
I’m reading this blog on Chrome 4.1 on Win 7 (unfortunately). The unicode text in Bengali in the blogpost renders properly on my browser even conjuncts. Is it possible that Chrome’s shortcomings in rendering stem from poor font management on the Linux version?
Well, it could be. Although it seems to affect some other Bengali fonts also. For example, please see the Chrome bug 20105. So there could be some other issues too.
প্রিয় গোলাম, আমি গুগল ক্রোমে, রেঞ্চ — চেঞ্জ ফন্ট অ্যান্ড ল্যাংগোয়েজ-এ গেলাম। সেখানে, তিনটে জায়গা দেখাচ্ছে, সেরিফ, সঁ-সেরিফ আর ফিক্সড-উইডথ ফন্টের। আমি সেরিফ ফন্টে ‘লোহিত বাংলা’ করলাম, অন্য দুটোয় কী করতে পারি? আমার ফেডোরা সিস্টেমে ওই ফ্রিসঁস আর ফ্রিসেরিফ নেই। অন্য কী করতে পারি? আর ডিফল্ট এনকোডিং নিশ্চয় ইউটিএফ-৮ রাখব?
You can set all three fonts to Lohit Bengali. However, it seems that “Freeserif” font works much better than others. Here is a screenshot (Using latest dev version of Chrome)
http://picasaweb.google.com/lh/photo/gimNjQFnZ-w5nWzUcTRggA?feat=directlink
Also, this font issue seems to affect only the Linux version and devs are aware of it. So hopefully, it will get fixed soon
ফায়ারফক্সে দিব্য কাজ করছে তোর ওই আনন্দবাজার পড়ার ইউনিকোড কলটা। আর গুগল ক্রোম চালালেই বাংলা ঝঞ্ঝাট তো আছেই, তার সঙ্গে মেশিনও উল্টোপাল্টা রকমে ঝুলে যাচ্ছে। কতদিন পরে যে হ্যাং হতে দেখলাম মেশিনে, মজাও লাগছিল। লিনাক্সে তো ওসবের গল্পই নেই প্রায়। তোর মেশিনের বাংলা ফন্টগুলো জিপ করে একটা পাঠিয়ে দে না। তারপর নয় ওই ফ্রিসঁসই দেব। লোহিত বাংলা তো সেরিফ ফন্ট। সেরিফহীন ফন্টে সেটা দেব কী করে? বা সমপ্রস্থ ফন্টও নয়। বাংলায় সমপ্রস্থ ফন্ট আদৌ হয় কি?
how you implemented Bengali language in your blog. I’m using the latest wordpress version 3 and trying to implement Bengali in my posts but whenever I write Bengali in wordpress and publish the post, everything turns into “????” If you guys know any solution to this, please let me know. Thanks!
Ami English to Bangali language translator ki bhabe pabo? janaben