Basic Sinhala is a beginning course presented in three modules dealing with the written in the Sinhala alphabet, this module must be undertaken first. H. The Sinhalese alphabet, which is one of the Brahmic scripts, a descendant of the ancient Indian Brahmi script .. Official Unicode Consortium code chart (PDF). Sinhala script (Sinhala: සිංහල අක්ෂර මාලාව) (Siṁhala Akṣara Mālāva) is a writing The core set of letters forms the śuddha siṃhala alphabet (Pure Sinhala , ශුද්ධ සිංහල), which is Official Unicode Consortium code chart (PDF).
|Language:||English, Spanish, Arabic|
|Distribution:||Free* [*Registration Required]|
Sinhala Alphabet - Download as PDF File .pdf), Text File .txt) or view presentation slides online. sin. The simplest letter in the Sinhala sometimes touch each other and they also have variations in alphabet is known to as “ර”, which is also specified as the writing. Details of the Sinhala alphabet and language, which is spoken mainly in Sri Lanka by about 12 million people.
Then the II. But this research The challenging nature of handwritten character recognition work has not addressed the possibilities of touching characters has drawn the attention of researches for a long time all over in the text and segmentation of the touching characters.
The work presented by M. M Karunanayaka, N. D These researches have explored many areas available in Kodikara, and G. P Wimalaratne in  has addressed the like the computational pattern recognition area with issue of touching characters. At the beginning, they have techniques such as artificial neural networks  and segmented the images using vertical projection profile method statistical approaches such as Hidden Markov Models  and at that level the touching characters are considered as a to recognize handwritten words or characters.
The first step single entity. Touching character groups, a overlapping, b touching, c intersecting, and d connecting Figure 2.
The unsteady nature of Then the segmented character entities are further classified handwritten characters make the recognition task difficult. The between two categories: In the process the average character width has been characters is a key factor to the accuracy of the character estimated using the width of the image and number of recognition process.
This research will be focused segmenting characters occurs in that image which is obtained from the Sinhala handwritten documents in to lines, words and vertical projection profile as given in 1.
Twenty three characters from the Sinhala alphabet given in Fig. The twenty four characters do not include the vowel signs as well as less frequently used characters. It is assumed that the characters of two consecutive text lines are not touching or overlapping and there is no slant in the text lines.
After distinguishing touching characters, the touching character group has been identified and segmented. Connected V. To distinguish and how the line segmentation, word segmentation, character between the other three groups, the concept known as Water segmentation and segmentation of touching characters were Reservoir Conceptdiscussed in  has been used.
Preprocessing trained using unsupervised learning. SOFMs facilitate A4 size papers were used for collecting handwriting representing multidimensional data using much lower samples. All the sample documents included 5 — 10 lines. The dimensional space. Also it creates a network that stores documents were scanned to get the images required for information in such a way that it represents the topological processing. The images were converted in to binary format relationships of the training samples.
The points that are close using OTSU  thresholding mechanism. Line segmentatiom analyzing tool of high dimensional data.
It is assumed that characters of two consecutive text lines are not touching or overlapping. With that assumption the There is no related work to be found in Sinhala image is segmented in to lines using the horizontal projection handwritten character segmentation with SOMFs. The work profile. The main idea behind their proposal is that If the characters of two consecutive text lines are not the touching pairs can be divided in to three main regions: The authors believe that these regions have projection profile correspond to the white space between the unique characteristics and by mapping the touching characters lines.
Word Segmentation The basic steps of the method proposed in  are, Each of the segmented lines goes through the word segmentation process. The assumption that the characters of 1 Estimating the core zone of the characters two words are not touching each other was made. With that 2 Extraction of feature points as input to SOFM assumption vertical projection profile was calculated for each 3 Determining segmentation path line segment. An example of the vertical projection profile of two text lines are given in Fig.
Use of horizontal projection profile for line segmentation As the gap between two characters within a word is The method proposed in  using SOFMs will then be relatively smaller than the gap between two words, a threshold used to segment the touching characters. I Core zone estimation The ascender and descender components of the characters D.
Basic Character segmentation will be removed in order to get the core zone of the characters. Each word segmented goes through the character As suggested by Kurniawan F. Again, vertical projection profile was is done in order to improve the clustering process . Touching Character segmentation III Feature extraction and using SOFM for clustering The touching characters should be identified at first before The simple feature extraction method will be used to moving in to the segmentation process.
To identify a touching generate the feature vector.
Number of maximum feature character, a similar approach to  has been used. The program can be installed on Android. Sinhala Alphabet version 1. Just click the green Download button above to start.
Until now the program was downloaded times. We already checked that the download link to be safe, however for your own protection we recommend that you scan the downloaded software with your antivirus. Version History Here you can find the changelog of Sinhala Alphabet since it was posted on our website on The latest version is 1. Sinhala transliteration Sinhala: Layman's transliterations in Sri Lanka normally follow neither of these.
Vowels are transliterated according to English spelling equivalences, which can yield a variety of spellings for a number of phonemes. Sinhalese script was added to the Unicode Standard in September with the release of version 3. Another block, Sinhala Archaic Numbers , was added to Unicode in version 7.
However, all versions of Windows Vista come with Sinhala support by default, and do not require external fonts to be installed to read Sinhalese script. For Linux , the SCIM input method selector allows to use Sinhalese script in applications like terminals or web browsers.
Unicode range. Northern Brahmic.
Southern Brahmic. It is also possible to have no vowel following a consonant. This marker suppresses the inherent vowel. Click on [show] on the right if you see only boxes below. Click on [show] on the right if you only see boxes below. Main article: Sinhala Unicode block.
History of the Evolution of the Sinhala Alphabet. Colombo Apothecaries' Company, Limited. Sunday Observer. Retrieved 21 September Sinhala numerals". Unicode Consortium.
Sinhala Online. Retrieved 23 September Retrieved Sinhalese language. Sri Lanka Sinhalese people. Central Southern Maldivian Rodiya dialect Vedda language.