Trichome

Content deleted Content added
No edit summary
Line 45: Line 45:
|align="center"| no official or traditional status, may use ''' ' ''' or '''‘''' or '''’''' or nothing
|align="center"| no official or traditional status, may use ''' ' ''' or '''‘''' or '''’''' or nothing
|}
|}

== Geographic names in the United States ==

In 1995 the [[United States Board on Geographic Names]] began changing its longtime policy and is now using {{Okina}}Okina and Kahakō in the [[Geographic Names Information System]].<ref>[http://unstats.un.org/unsd/geoinfo/gegn23wp82.pdf U.S Board on Geographic Names: Collection and Dissemination of Indigenous Names] (United Nations Group of Experts on Geographical Names, Twenty-third Session Vienna, 28 March – 4 April 2006, Working Paper No. 82), S. 3: ''"An example of this has been the addition of the glottal stop (okina) and macron (kahako) to placenames of Hawaiian origin, which prior to 1995 had always been omitted. The BGN staff, under the direction and guidance of the Hawaii State Geographic Names Authority, has been restoring systemically these marks to each Hawaiian name listed in GNIS."''</ref>


== Encoding and displaying the Polynesian glottal ==
== Encoding and displaying the Polynesian glottal ==

Revision as of 01:16, 19 September 2009

ʻOkina letter forms
Hawaiian ʻokina The Tongan fakauʻa letter or Hawaiian ʻokina encoded as U+02BB (in Unicode[1]), derived from the Lucida Sans font.
Tahitian ʻeta The Tahitian ʻeta letter (or Wallisian fakamoga), currently not encoded correctly, derived from the Lucida Sans font.

The ʻokina, also called by several other names (see examples below), is a unicameral consonant letter used within the Latin script to mark the phonetic glottal stop, as it is used in many Polynesian languages.

Area Vernacular name Literal meaning Notes
Hawaiian ʻokina separator transitionally formalised
Tongan fakauʻa
(honorific for fakamonga)
throat maker officially formalised
Wallisian (in ʻUvea) fakamoga throat maker no official or traditional status, may use ' or or
Tahitian ʻeta ʻetaʻeta = to harden no official or traditional status, may use ' or or
Cook Islands Maori ʻamata or ʻakairo ʻamata "Hamsah" or "Hamsah mark" no official or traditional status, may use ' or or or nothing

Geographic names in the United States

In 1995 the United States Board on Geographic Names began changing its longtime policy and is now using ʻOkina and Kahakō in the Geographic Names Information System.[2]

Encoding and displaying the Polynesian glottal

Old conventions

In plain ASCII the glottal is sometimes represented by the apostrophe character ('), ASCII value 39 in decimal and 27 in hexadecimal, which in most fonts currently used renders as a straight, data-processing, typewriter apostrophe as is also specified in Unicode. But in some older fonts, especially those used on Unix-like platforms and related platforms and on an MS-DOS screen, it renders as a right single quotation mark (which is the wrong shape).

A "hypercorrect" (that is, incorrect) method for plain ASCII text is to use U+0060 grave accent (`) (incorrectly termed "back-quote character"), which in some older fonts does display a glyph similar to a left single quotation mark. However, in most newer fonts, it has a pronounced lean to the left and can look inappropriate. A (partial) advantage is when a wordlist is alphabetically sorted, the "`" often comes after the "z", exactly where it should be in the Tongan language (admittedly not so in most other Polynesian languages, where it should be ignored). It is still useful as a fallback when words are to be entered into a database with limited character-set ability to have the character distinct from the apostrophe.

The new standard and transitional problems

According to Unicode, the codepoint for ʻokina is Unicode character U+02BB MODIFIER LETTER TURNED COMMA ( ʻ ) which can be rendered in HTML by the entity &#699; (or in hexadecimal form &#x02BB;).[1]

But lack of support for this character in older fonts (and many newer fonts), along with the large amount of legacy data and expense in time and money to convert, has prevented easy and universal use of the new character. As of 2008 Apple Mac OS X based computers have no problem with the glyph, but Microsoft Windows especially still has, but is no longer a problem in Internet Explorer 7 as it was in previous versions. U+02BB should be the value used in encoding new data when the expected use of the data permits.

This character is also a proper one for a Latin-letter transliteration of the Hebrew letter [[Ayin|

ʻáyin]] and the Arabic letter

[[Ayin|

ʻayn]].

They are sometimes also rendered by a superscript half ring with the opening to the right ( 

ʿ ) or even, as a typographical fallback, a superscript cc ).

Unicode encodes a glottal stop at U+02C0 MODIFIER LETTER GLOTTAL STOP (ˀ), but this looks like an undotted question mark, which is inappropriate for ʻokina.

Its orientation and curve should not depend on the font style for apostrophes (so using a left apostrophe is wrong too, because it can be drawn either like a superscript non-curved mirrored comma, or a superscript 6-shaped apostrophe).

True Polynesian texts however draw the okina very differently, and this looks as none of the apostrophe, mirrored apostrophe, turned comma, or accent letter. The Polynesian ʻokina letter is more like 9-shaped left apostrophe, turned about 60 to 90 degrees counter-clockwise.

Tentative approximations

A display work-around

Because this character is not found in many fonts, it may not appear properly on all computer systems and in all configurations. Accordingly, where U+02BB should properly be used, the Unicode punctuation character U+2018 LEFT SINGLE QUOTATION MARK, ‘, represented by the HTML entity &lsquo;, is sometimes used instead. It is nearly identical in appearance to U+02BB, but is treated as a punctuation mark rather than a letter by applications.

In practical terms, this only matters with regard to page breaks, hyphenation, and capitalisation; these usually cause few problems. This symbol is also used instead of the recommended turned comma letter symbol in transliterations from Semitic languages to ensure proper display on the widest number of browsers.

The problem with this left single quotation mark character is that, depending on font style design, the single quotation mark may have two very different shapes, one of which is incompatible with the okina :

  • a superscript straight mirrored comma, drawn from bottom to top and normally thicker on the bottom right than on the top left. The thicker end on the bottom is incompatible.
  • the modifier letter turned comma, but it may still be wrong as it could be drawn in some font designs as an oblique straight line or a wedge without the needed curve, or the curve will be made so that its center will be on the left or top right, when the okina curve should be centered and opened on the bottom or bottom left.

A work-around problem

Nowadays many word-processors are equipped with 'smart quotes', which automatically change the straight apostrophe (') and the straight quotation mark (") into curly ones. If a quotation mark occurs after a space, it is assumed to be an open quote (the left quote), if elsewhere a close quote (the right quote). This policy also allows the apostrophe to be dealt with in the same way. Clearly this is not the behaviour one wants for the glottal. One would end up with text full with 'drunken' glottals, some pointing left, some pointing right. If a special Polynesian keyboard layout is not available, a workaround to the workaround is to insert a ‘dummy’ space before typing the quote (thus making it a left, open quote), then delete the space.

Also standard undo function of the word processors removes the bad autocorrections, for example using the undo icon on the toolbar or pressing CTRL-z in the most widely spread office suites, after the autocorrection happens.

Another problem

In some sans-serif fonts non-bolded and at normal size, the left single quotation character does not appear distinctly different from the straight apostrophe or from the right single quotation character. In Hawaiian, where only one of these curly quotation forms is used as a letter, this matters little. It is more problematic in displaying transliterations from Semitic languages where both left-quotation and right-quotation characters are used with different meanings. Luckily, nearly all scholars of native Hawaiian agree that the apostrophe is an accurate proxy when available and the okina provides no additional value and thus is unnecessary. Now the standard symbol used to separate the final two i's in Hawaii is an apostrophe. This is also the officially approved symbol according to the AP Style Guide.[3]

See also

References

  1. ^ a b Unicode Standard 5.1
  2. ^ U.S Board on Geographic Names: Collection and Dissemination of Indigenous Names (United Nations Group of Experts on Geographical Names, Twenty-third Session Vienna, 28 March – 4 April 2006, Working Paper No. 82), S. 3: "An example of this has been the addition of the glottal stop (okina) and macron (kahako) to placenames of Hawaiian origin, which prior to 1995 had always been omitted. The BGN staff, under the direction and guidance of the Hawaii State Geographic Names Authority, has been restoring systemically these marks to each Hawaiian name listed in GNIS."
  3. ^ APStylebook.com

External links

Leave a Reply