Case Study – Multil ingual URLs
This is about a website which helps ordinary users access hard-to-type or
local-language Internet addresses (URLs) and doubles up as a quick URI
encoder/decoder for programmers.
The Audience
Any computer user who communicates in any language whose written form
utilises characters other than the plain 26 letters and 10 numerals in the
Latin (English) alphabet.
The Problem
You need to send someone, say our Arnab in Finland, a Bengali (or any
other local language) URL. But you're not sure how it will show up (???)
Usually anything in Bengali shows up as a series of the first consonant of the
Bengali language viz. “ ” in Arnab’s Macintosh computer.
The Solution
Go to http://unescape.hereweb.com/ and click the "Clear All" button. Enter
your Bengali (or other local language) URL in the white input box – plenty of
Bengali ones are obtainable from http://bn.wikipedia.org/ – and click the
"Encode" button at http://unescape.hereweb.com/. Now triple-click the text
labelled "Result" to automatically select it fully including the invisible tail end.
Copy the selected text and paste it in your email message or send it
however you wish to send it to Arnab - even by SMS. Now Arnab will be able
to access that page with the Bengali (or other language) URL. Whether he
can read the text will depend on many other factors and if he can't he'll at
least be able to experience the multimedia including images and most of all
he'll be able to preserve the URL or forward it to someone or access it from
a PC (as opposed to Mac) at some convenient time and place.
Example
Many social networking sites can’t handle local language text, right? Well
almost! The following mixed-language URL won't make sense to your
browser because the way its Bengali portion has been represented:
http://bn.wikipedia.org/wiki/প্রিয
492;ঙ\2509;কা_বঢরা
Now the "Encoded" (as discussed) version of the same URL pointing to the
same page is in the following line:
http://bn.wikipedia.org/wiki/%E0%A6%AA%E0%A7%8D%E0%A6%B0%E0
%A6%BF%E0%A6%AF%E0%A6%BC%E0%A6%99%E0%A7%8D%E0%A6
%95%E0%A6%BE_%E0%A6%AC%E0%A6%A2%E0%A6%B0%E0%A6%BE
Yes the encoded URL is a bit longer, but it works! Try it. By the way the
same URL as above looks like this in unencoded form:
http://bn.wikipedia.org/wiki/ _
For Programmers
You guys know this minisite is a very simple JavaScript application but it is very
handy when you need the encoded or decoded form of something quickly as in
the middle of writing a program. In fact if your input begins with “http” then it
will be automatically handled with decodeURI or encodeURI; else with
decodeURIComponent or encodeURIComponent.
Forwarding This
This message was created with the objective that it should reach every
computer user whose mother-tongue is not English. So a few words about how
1/2
to forward an email message without compromising the recipients’ privacy. In
the To field type undisclosed-recipients@example.com. Now click “Show Bcc”
(or something having pretty much the same implication) if the Bcc field isn’t
showing. Put all your recipients’ addresses in the Bcc field. This way your
recipients will know who the message came from but they won’t see each
others’ addresses. So you won’t be exposing tons of addresses to potential
spammers by forwarding this message. Finally check that there are no
unnecessary email addresses in the body of the message you are forwarding
and that any attachments such as this file CaseStudy.pdf is properly attached.
Of course, sooner or later, you will get a benign delivery-failure notification
about the fictitious recipient undisclosed-recipients@example.com. That doesn’t
in any way suggest delivery-failure to the Bcc recipients. If any of those fail
you’ll get a separate notification per address. In case you are wondering, Bcc
stands for Blind Carbon Copy and is quite different from Cc.
This Document
http://www.scribd.com/doc/16012068/Share-local-language-URLs
http://snow.byethost17.com/unescape/CaseStudy.pdf
http://unescape.hereweb.com/CaseStudy.pdf
Keywords
URL, encode uri, decode uri, URI Component, local language, multilingual, bilingual, local, percent-
encoding, escape, unescape, font problem, share URL, quick reference, Internet address, accent, diacritical,
acute, grave, circumflex, cedilla, umlaut, breve, diaeresis, dieresis, hacek, macron, tilde, wedge,
Wikipedia, email forwarding, any computer, Российский URL, Russian URL, ,ا ان اArabic URL,
中文网址, 中文網址, Chinese URL, Web français, French URL, Nederlands URL, Dutch URL, 日本
語のURL, Japanese URL, Suomi URL, Finnish URL,
, Tamil URL, Javanese URL,
URL, Bengali URL, Português URL, Portugese URL, Deutsch-URL, German URL, Shqipe
URL, Albanian URL, Български URL, Bulgarian URL, Català URL, Catalan URL, Hrvatski URL,
Croatian URL, Česká URL, Czech URL, Dansk URL, Danish URL, Eesti URL, Estonian URL,
Filipino URL, Galego URL, Galician URL, Ελληνική URL, Greek URL, ¯, Hebrew
URL, ि Û ी , Hindi URL, Magyar URL, Hungarian URL, Bahasa Indonesia URL,
Indonesian URL, URL italiano, Italian URL, 한국어 홈페이지, Korean URL, Latviešu URL,
Latvian URL, Lietuvių URL, Lithuanian URL, Malti URL, Maltese URL, Norsk URL, Norwegian
URL, Polski URL, Polish URL, URL-ul românesc, Romanian URL, Српска УРЛ, Serbian URL,
Slovenský URL, Slovak URL, Slovenski URL, Slovenian URL, Español URL, Spanish URL,
Svenska URL, Swedish URL, URL , Thai URL, Türkçe URL, Turkish URL, Ukrainian URL,
Ti ng Vi t URL, Vietnamese URL, Oriya URL, Sinhalese URL, Malay URL, Afghani URL, Pashtu
URL, Pashto URL, Mon-Khmer URL, Armenian URL, Bhutani URL, Burmese URL, Kampuchean
URL, Persian URL, Latin URL, Greek URL, اردوurl, Urdu URL, Punjabi URL, KISARA URL,
Swahili URL, ी, ગ ી, , , , , Punjabi, Русский, Russian,
,اArabic, 中文, Chinese, français, French, Nederlands, Dutch, 日本語, Japanese, Suomi, Finnish,
, Tamil, Javanese, , Bengali, Português, Portugese, Deutsch, German, Shqipe, Albanian,
Български, Bulgarian, Català, Catalan, Hrvatski, Croatian, Čeština, Czech, Dansk, Danish, Eesti,
Estonian, Filipino, Galego, Galician, Ελληνικά, Greek, , Hebrew, ि Û ी, Hindi, Magyar,
Hungarian, Bahasa Indonesia, Indonesian, Italiano, Italian, 한국어, Korean, Latviešu, Latvian,
Lietuvių, Lithuanian, Malti, Maltese, Norsk, Norwegian, Polski, Polish, Român, Romanian, Српски,
Serbian, Slovenský, Slovak, Slovenski, Slovenian, Español, Spanish, Svenska, Swedish, , Thai,
Türkçe, Turkish, Ukrainian, Ti ng Vi t, Vietnamese, ,اردوUrdu, Malay, Kampuchean, Burmese,
Bhutani, Oriya, Sinhalese, Afghani, Pashtu, Pashto, Mon-Khmer, Armenian, KISARA, Swahili,
Telegu
2/2