TeluguPeople
  are the trend-setters

 
Articles: My Thoughts
Common Script for Indian Languages
- Mr. nageswara Rao akula
  Page: 1 of 1    
BHARATHI SCRIPT Common Character Set for Screen Editors for Indian Languages. Dedicated to Lord Sri Venkateswara. Om Venkateswarayanamaha. Key Words: Alphabet :These are a set of spelling elements to compose the Phonum. Font :Unique graphic character (symbol) 1. Script :A set of FONTS for a set of spelling elements (Alphabet). Alphabetic script :A Font will not combine with another Font to form a composite Font to represent a syllabic or conjunct letter. Syllabic script :The Fonts do combine to form a composite Font to represent a syllable or a conjunct letter. Abstract: This article is an honest attempt made to give an alphabetic character set for use of keyboards and screen editors. By simply adding this character set as Font in screen editing in any one of the word processing programs an ISCII-7 file, (like ASCII file) can be made for all the languages. By using this file as an input ISCII-7 file to any Indian script processors the text can be read in that script Introduction: The information technology wave has touched every country. Internet paved the way for communication between Computers and PCs through network. That means communication between several (different) Operating Systems, different character set Codes and processors have different Machine Codes. This has necessitated to have Common character set Code and Machine Code. That is what we observe in JAVA virtual Code concept. Even for input output (interface) operation of Indian Scripts with computers, CDAC has evolved common electrical code (Inscript). Computer Scientists have adopted Universal Net Language (UNL) based on ‘BHAVA’ (The human perception) to achieve communication (i.e. with on-line translation between several linguistic groups. Thus, the computer Scientists has evolved an effective common electrical code for universal communication. ____________________________________________________________________________ 1 The Word “Font” has been used deliberately to mean a graphic character rather than a character set. Personal computers are cost effective and have achieved unimagined power, applicability and pervasiveness. The revolution has just begun and it will take a decade to assess fully its impact on society. It is certain; that India with millions of computers will be different from the India we have now. Computers / word processors are script (code) dependent. For them, a WORD is a string of letters in between two delimiters irrespective of meaningful information as inferred in any language. So the only limitation of computers / word processor based on any script is as how best it could transliterate any language phonetically. There are three major environments for which the data reproduction is required, Namely (a) Input environment (b) Output or display environment (c) Internal representation of storage and processing environment. The input environment essentially required linearisation of input text and keyboard design. Keyboarding efficiency and editing are some of the major considerations while inputting the Text/data. It is quite uncomfortable to the user, if he does not have some bare minimum facilities for EDIT while entering a text (scripts). It is in the context of Indian Scripts is little more involved than with Roman. No single representation is suited for these three environments. Keyboards and VDU/ printers play an important role in man machine interface in computers and communication equipment’s. These are based on a set of graphic Characters and it’s Unique electrical code . Two approaches are widely used in their design, they are : a) Phonetic approach wherein phoneme (spelling elements) play an important role than fonts. In Indian scripts, these phoneme are common to all Indian languages and dialects. b) Graphic approach wherein fonts play an important role than spelling. In Indian languages the actual fonts are 100 in number and are different from script to script. In some instances, the TYPE ( Font ) represents a similar graphic shape for three or four letters We observe that Roman script Fonts are widely utilised in communication and computers in European countries. European scripts have common Roman fonts in which each font represents a different Phonum in different Languages. But they have adopted Roman Fonts for different European languages. What we observe in Europe is common Fonts but not common script. On the contrary, in India we have common phoneme ( Alphabet ) viz.अआइईउऊ, कखगघ, यरलवश….. etc., but not fonts for different languages, which is an example of phonetic approach. Indian scripts have the same spelling elements (consonants and vowel signs) and follow the same spelling diction. Observation of graphic aspects of Indian scripts has 104 types, which consists of Vowels-16, Vowels and Vowels signs-14+3, Alphabetical syllables-35 and Consonants-35. The text / data entry in Indian Languages uses the ISCII-7 code. This code is a phonetic code which is widely used in spelling of Indian languages. The graphic aspect of various Indian scripts still exists while generating the composite characters [conjunct, and syllables] but these are script dependent. Lot of improvements has taken place for effective optimisation of codes and algorithms in many faculties. Unfortunately, no improvements have been made in Indian Scripts and we are still following the same 11th century scripts except for little changes made for adopting them for printing. Now is the time to extend those coding principles for the Indian Scripts (i.e., graphic character sets), thus, evolve a common graphic character set as an alphabet for keyboards and Screen Editors for Indian Languages. Indian languages spelling and Phonetic Coding (a) Syllable – as a combination of consonant (halant pronunciation) plus (+) vowel sign (matra). (b) Conjunct letters as a combination of two or more consonants plus (+) vowel sign. (c) Alphabetic (script) consonants as a combination of halant consonant plus (+) `अ ‘vowel sign. The above examples (definitions) clearly illustrate that the spelling elements are mainly Consonants and Vowel signs. In general we have 14 vowels and 14 vowel signs apart from 3 nasal signs. We observe that the frequency of occurrence of Matra symbols is higher than corresponding vowels. Hence in the case of shortage of space on keyboards MATRA symbols can be retained on key tops instead of Vowels. In such case, a vowel can be generated through operation of vowelisation key (VK) fallowed by respective Matra symbol. The same is followed in Savarnamala technique also, wherein letter अ followed by vowel sign to represent the corresponding vowel. It looks like अ barahkhadi. This adaptation optimises requirements of Phoneme for both vowels and vowel signs ( Matras ) from 14 + 14 =28 to 14 + 1 = 15., The implied अ vowel sign ( Matra ) in stand alone alphabetic consonant and lack of अ matra and has resulted in separate set of Halant consonant fonts apart from alphabetic consonants. If we adopt halant consonant as fundamental shape (character) and have अ vowel sign, the number of characters will be only 35+1=36 instead of 35+35 =70. The additional TWO Consonants of south Indian languages ள ழ and क़ ख़ ग़ ज़ ड़ ढ़ फ़ , the SEVEN heavy Consonants makes the total 35+2+7 = 44 Phoneme. So, in total, we can reduce the phoneme to 44+15=59 plus 3 nasal signs ( ँ , ं, ः ). Thus the total will be 59 + 3 = 62 Phoneme. These 62 phoneme are well suited for adaptation to 7-bit ASCII code with all the control codes (00H to 1FH) unchanged. . This code is a phonetic code which is widely used in spelling of Indian languages. Hence our people can easily follow it. This optimisation resulted as Indian Standard Code for Information Interchange ( ISCII 7-bit ) CODE . This code is used for Text entry ( Internal Representation and Storage ) of Indian Languages, Processing environment and Communications. Huge investments have been already made on all existing Input / Output Devices ( machines ) are based on ASCII 7-bit. Indian Languages can exploit this advantage with this adoption. The scripts are of two types viz. a) Alphabetic b) Syllabic In Indian languages, the syllabic scripts are chosen wherein each composite font represent EAR level Phonum ( since ear detect syllable as single ( integral ) sound but not as a combination of two sounds ) . By adopting a syllabic script the readability improves. If unique graphic characters are not chosen for Syllabic Script, it becomes difficult to identify the Composite characters. Alphabetic Script has an advantage of linearization and ease of keyboarding and Editing. Script is Unique Graphic Code (a set of Graphic Characters ) for a set of Spelling Phoneme (sound elements ) . The script that follows unique correlation between graphic characters (Fonts) and spelling phoneme causes familiarisation and learning easier. Such PHONETIC Script has Enormous Transliterating capabilities. Recognition and Learning:- We observe that a child of few months old can distinguish between familiar ( i.e. mother and persons attending the child ) and unfamiliar persons. Similar is the case with their voices also. From this, it is quite evident that the child is able to recognise the familiar /similar things as one group and unfamiliar / dissimilar ones as different groups. This is possible because the child uses the built-in (Natural) visual and voice recognition algorithms /templates. This phenomenon helps the child to identify and understand the nature, since it follows truly the natural algorithm. For example, when we want to represent an animal or human being by line drawing or by painting, the third person recognises it as long as the drawing follows the natural algorithm of pattern recognition. If the painter uses his own algorithm, the third person cannot recognise it unless he has the prior knowledge of the algorithm. The most important factor to develop a common script for transliterating Indian Languages is the selection of widely known Fonts, for easy recognition. While doing so, it should be kept in mind that the number of Fonts, with unique graphic feature for better identification, and the number of spelling phoneme of the set of Languages should be uniquely correlated. Thus, the optimised phonetic script requires a set of 55 distinct graphic characters. Familiarising a set of 55 unique graphic characters is very difficult. The next section examines the possibilities of further grouping of Phoneme to arrive at a relatively less number of unique graphic characters. An optimal grouping of vowel signs: We observe in Vowels that both normal (short) and long forms are there. These are grouped as a group. अ आ इ ई उ ऊ ऋ ॠ ऎ ए ऐ ऒ ओ औ Nasal Signs- - - -Three ँ ं ः An optimal grouping of Consonants: We observe in Consonants have both Normal (soft) and Hard forms. Each pair is grouped as a phonetic group. क ख ग घ ङञ च छ ज झ ट ठ ड ढ णन त थ द ध प फ ब भ म य र ऱ ल ळ व श ष स ह Four Special characters ( Southern Labguages).. ఱ ళ ழ ள Total 31 characters having distinct (unique shape ) Doublets ------- 20 = 40 18 = 36 Triplets ------- 1 = 3 3 = 9 Singles ------- 10 = 10 8 = 8 31 = 53 29 = 53 The option before us is to have either 53 distinct shape characters or 31 distinct shapes (with 20 doublets, 3 triplets and 8 single distinct shapes Graphic Grouping : We observe that Roman and Devanagari fonts have been most widely known among Indians. These script characters are examined and categorized further them into broad group of shapes. Vowels and consonants . ढ ट द ड इ झ भ म ग घ ध छ क व ब फ प ष श र स क फ व ब उ अ ञ ऋ य थ ए त ह च ज Types No of Groups No of Characters Triplets 7 21 Doublets 5 10 Singles 5 5 17 36 . In Devanagari script, there are only 17 unique Fonts ( refer to above analysis ). In case we fall short of doublets some of the triplets can be used as doublets. Roman script: The Roman script characters can be broadly categorized into 15 unique fonts or shapes. O Q G N M H P R B C G Q K X F E D B T I V U A Z J Y L W The detailed study of Indian scripts reveal requirements of 38 unique Fonts for consonants (5 single fonts, 15 doublets and 1 triplet ), 14 vowel signs (2 single fonts and 6 doublets ) and 3 nasal signs. An attempt has been made to select a set of Fonts chosen from both Devanagari and Roman Fonts to formulate a common script ( designated as “Bharathi”) For clear demarcation of Vowel signs and Nasal signs a set of Roman Characters are adapted, similarly another set of Devanagari characters are adapted for Consonants and Vowelisation character. This adaptation makes the reading of this script easier. With the introduction of Vowelisation Character उ and अ Matra Character the optimised script ( Bharathi ) has Matras ( vowel signs ) and Halant consonants as unique Graphic characters. Text is generated by juxtaposition of Graphic characters as per collation order. Vowelisation character (1) उ Vowel signs (14) Vowel signs are categorised as Normal (short) and Long forms in each phonetic group. Devanagari Vowels Bharathi Vowel signs Normal Long अ आ T इ ई F E उ ऊ U V ऋ ॠ K X ऎ ए e A ऐ Y ऒ ओ O Q औ G Nasal signs are 3 DevanagariNasal signs Bharathi Nasal signs ँ N ं M ः H Consonants 35 Consonants are categorised as Soft and Hard forms in each phonetic group DevnagariAlphabeticConsonants BharathiConsonants DevnagariAlphabeticConsonants BharathiConsonants Soft Hard Soft Hard Soft Hard Soft Hard क ख k K ग घ g G च छ C C ज झ j J ट ठ t T ड ढ d D त थ f F द ध w W प फ p P ब भ b B ङ ञ ` X न ण n N म m य y र ऱ r R ल ळ l L व v स श s S ष x ह h The Da and Pa Barah Khadi are illustrated below ड डा िड डी डु डू डृ डॄ डॆ डे डै डॊ डो डौं डं डः d dA di dI du dU dq dQ de dE dY do dO dV dM dH प पा िप पी पु पू पृ पॄ पॆ पे पै पॊ पो पौ पं पः p pA pi pI pu pU pq pQ pe pE pY po pO pV pM pH The first sloka of Venkateswara Suprabhatham “kausalya supraja rama ….” and has been transliterated in the Bharathi script and illustrated below . kVslzyA supzrjrAmA pUrzvA sMwzyA pzrvrzffe aufzfixzT nrSArzWUl krzfvzym wYv mAnzhikmz The first sloka of Venkateswara Prapathih “Eesanam jagatosya venkatapate …” has been transliterated in the Bharathi script and illustrated below aISAnAM jgfOszy vEMktpfE rzvixNOM prAM pzrEySIM fwzvkzxszfl nifzyvAsrSikAM ffz kzxAMfi sMvrzWinIM pwzmAlMkqf pANiplzlv yUgAM pwzmAsnsFAm SzrEyM vAfzslzyAwi guNojzvlAM BgvfIm vMwE jgnzmAfrmz Janaganamana song has been transliterated in the Bharathi script and illustrated below. jngNmn aWinAyk jyhE BArf BAgzy viWAf pMjAb siMWu gujrAf mrAT wzrAvid aufzkL vMg viMWzy himAcl ymunA gMgA auczcl jlWi frMg fvSuB nAmE gAyE fvSuB aASixmAgE fvjy gAF jngN mMgL wAyk jyhE BArf BagzyviWAf jyhE jyhE jy jy jy jyhE The view of having one script by Mahatma Gandhi is espoused in the following. “The question of having ONE Script for all the Indian Languages …….has been before the public for number of years. The burden of knowing number of scripts is unnecessary and easily avoidable. I would like to appeal to the Men of Letters of all provinces to resolve their differences on this point and be agreed on this matter of prime importance .Yet in this days of aggressive provincialism, perhaps any Plea for one script will be regarded as an impertinence. But the Literary Campaigns raging all over the country should compel a hearing for Advocates of One Script. I have been one such for Years. Interprovencial intercourse will be much felicitated by such adoption and learning of various provincial languages will be made infinitely easier than it is today. If the educated people of the land were to put their heads together and decide upon one script , its universal adoption should be an easy thing. If they will adopt a common script they will save for future generations tons of labour and time and earn their blessigs”. Harizan …5 Aug 1939. Scientists does research and develop devices, the Intelligentsia use them and assess their effectiveness. If the device is simple, easy to understand and effective, they become popular. This can be seen from the famous quotation “ The Facts are Facts, Facts are Seen, Facts can be Realized, if realized facts are reproduced it is a perfect Science. Theory is known as theory of Realization. Thus the Scientists and Technologists have emerged as universal community”. Let the computer scientists analyse script and adopt this phonetic Alphabet (script) for Key boards and Screen Editors (for Inputting and Editing of the Text / Data) of the Indian Languages, they will realise the potentialities and ease of learning this Alphabetic Script. A day may come that this script be adopted as common script for transliteration of Indian Languages. With the advent of word processors with DTP capabilities, important books and documents can be made (copied) on CD‘s using ISCII –7 code. The editing of them becomes easy using these Fonts for Screen Editors and Keyboards. The printing them using Inkjet printers / laser printer can be done either with existing regional script or the new script. Conclusions: The finalization of the phonum assignment to the Fonts can be done only after the frequency of occurrence of spelling level phoneme in various Indian languages. The most occurring phonum can be assigned to a font which is most easier in writing. The next frequently occurring phonum is assigned to the next easier Font and so on. Thus the most effective Indian alphabet can be evolved after assessing. The statistical distribution of graphic characters occurring in all the Indian languages can be assessed using the proposed new phonetic script with the help of computers. The intelligentsia should keep in mind that the alphabet is introduced to children of five to eight years age group, hence it should simple and effective. The present Indian script’s barahkhadi, other composite characters are generated by software package in the background shows the complicatedness of them. Any reform in modifying the Indian scripts without changing the spelling elements of our languages. Acknowledgment: With all humility, gratitude and devotion I acknowledge my indebtedness to Lord Sree Venkteswara, whose supreme power inspired me with the thought to explain the phonetic script, “BHARATHI “ and its adoptability to modern devices. References: a) Nageswara Rao (A). PANCHAJANYA (evolution of new phonetic script “BHARATHI “ and devices ) 1981, b) Nageswara Rao (A). Indo-Roman script for transliteration of Indian Languages, JIETE Nov 1984. (Special issue of Computerisation of Indian Languages.) For further particulars contact: A.Nageswara Rao ., Jt. Director( Retd.) , DLRL E-mail: akula_nagesh@rediffmail.com 137. Midhani Enckave, Kanchanbagh Hyderabad. 500 058. **** From Editors' Desk - Some of the characters posted by the User may not be appearing appropriately. Also, since we find this article to be unique in nature, we have also not attempted much to change the format.

Read 1 Comment(s) posted so far on this Article!

  Page: 1 of 1    



 
Advertisements
Advertisements
Advertisements
Beauty and Skin Care
For all your favorite branded products of Beauty, Skin Care, Perfumes, Makeup and more!
News
Headline News
Cinema News
Business
Special Stories
Devotion
NRI News
Social Media
Facebook
Movie Gallery
Devotional Gallery
Twitter
Photo Galleries
News Gallery
Cinema Gallery
Beauty Gallery
Fashion Gallery
Sports Gallery
Travel Gallery
Devotion
Classifieds
Jobs
Real Estate
Automobile
Personals

Search TeluguPeople.com

(C) 2000-2025 TeluguPeople.com, All Rights Reserved.