Skip to content

Orthography#

The [[index|Japanese]] language makes use of three different scripts - two [[Writing Systems#Syllabaries|syllabaries]] and one [[Writing Systems#TODO|logographic]] script. The syllabaries, however, do not represent individual syllables. Instead, each character in them represents a single [[Orthography and Phonology#Moras|mora]].

All three scripts are used simultaneously when writing but they usually serve different purposes. Japanase may be written either horizontally (left-to-right and top-to-bottom) or vertically (top-to-bottom and right-to-left).

Kana#

Kana is the collective name used for the two [[Writing Systems#Syllabaries|syllabaries]] used in the [[index|Japanese]] writing system - hiragana and katakana. Each symbol in these syllabaries representes a single [[Orthography and Phonology#Moras|mora]] and both syllabaries represent the same set of sounds. However, they serve different purposes. Below are tables of both hiragana and katakana symbols, written together with their pronunciation and romanization.

Hiragana
a i u e o
ø
/a/
a

/i/
i

/u/
u

/e/
e

/o/
o
k
/ka/
ka

/ki/
ki

/ku/
ku

/ke/
ke

/ko/
ko
g
/ga/
ga

/gi/
gi

/gu/
gu

/ge/
ge

/go/
go
s
/sa/
sa

/ɕi/
shi

/su/
su

/se/
se

/so/
so
z
/(d)za/
za

/(d)ʑi/
ji

/(d)zu/
zu

/(d)ze/
ze

/(d)zo/
zo
t
/ta/
ta

/tɕi/
chi

/tsu/
tsu

/te/
te

/to/
to
d
/da/
da

/(d)ʑi/
ji

/(d)zu/
zu

/de/
de

/do/
do
h
/ha/, /wa/
ha, wa

/çi/
hi

/ɸu/
fu

/he/, /e/
he, e

/ho/
ho
p
/pa/
pa

/pi/
pi

/pu/
pu

/pe/
pe

/po/
po
b
/ba/
ba

/bi/
bi

/bu/
bu

/be/
be

/bo/
bo
m
/ma/
ma

/mi/
mi

/mu/
mu

/me/
me

/mo/
mo
n
/na/
na

/ni/
ni

/nu/
nu

/ne/
ne

/no/
no

/ɴ/
n
r
/ra/
ra

/ri/
ri

/ru/
ru

/re/
re

/ro/
ro
w
/wa/
wa

/i/
i
(obsolete)
𛄟
/u/
u
(obsolete)

/e/
e
(obsolete)

/o/
o
(obsolete)
y
/ja/
ya
𛀆
/i/
i
(obsolete)

/ju/
yu
𛀁
/e/
e
(obsolete)

/jo/
yo
Katakana
a i u e o
ø
/a/
a

/i/
i

/u/
u

/e/
e

/o/
o
k
/ka/
ka

/ki/
ki

/ku/
ku

/ke/
ke

/ko/
ko
g
/ga/
ga

/gi/
gi

/gu/
gu

/ge/
ge

/go/
go
s
/sa/
sa

/ɕi/
shi

/su/
su

/se/
se

/so/
so
z
/(d)za/
za

/(d)ʑi/
ji

/(d)zu/
zu

/(d)ze/
ze

/(d)zo/
zo
t
/ta/
ta

/tɕi/
chi

/tsu/
tsu

/te/
te

/to/
to
d
/da/
da

/(d)ʑi/
ji

/(d)zu/
zu

/de/
de

/do/
do
h
/ha/, /wa/
ha, wa

/çi/
hi

/ɸu/
fu

/he/, /e/
he, e

/ho/
ho
p
/pa/
pa

/pi/
pi

/pu/
pu

/pe/
pe

/po/
po
b
/ba/
ba

/bi/
bi

/bu/
bu

/be/
be

/bo/
bo
m
/ma/
ma

/mi/
mi

/mu/
mu

/me/
me

/mo/
mo
n
/na/
na

/ɲi/
ni

/nu/
nu

/ne/
ne

/no/
no

/ɴ/
n
r
/ra/
ra

/ri/
ri

/ru/
ru

/re/
re

/ro/
ro
w
/wa/
wa

/i/
i
(obsolete)
𛄢
/u/
u
(obsolete)

/e/
e
(obsolete)

/o/
o
(obsolete, particle)
y
/ja/
ya
𛄠
/i/
i
(obsolete)

/ju/
yu
𛄡
/e/
e
(obsolete)

/jo/
yo

Voiced kana are obtained by adding the diacritic ゛(濁点(だくてん, dakuten)) to the respective voiceless kana.

Additionally, Smaller-sized versions of や, ゆ and よ for hiragana and ヤ, ユ and ヨ for katakana can be used in combination with other hiragana / katakana to form digraphs representing palatalized sounds. Each such digraph still corresponds to exactly one [[Orthography and Phonology#Moras|mora]].

Hiragana Digraphs (拗音 (ようおん))
ya yu yo
k きゃ
[kʲa]
kya
きゅ
[kʲɯ]
kyu
きょ
[kʲo̞]
kyo
g ぎゃ
[gʲa]
gya
ぎゅ
[gʲɯ]
gyu
ぎょ
[gʲo̞]
gyo
s しゃ
[ɕa]
sha
しゅ
[ɕɯ]
shu
しょ
[ɕo̞]
sho
z じゃ
[(d)ʑa]
ja
じゅ
[(d)ʑɯ]
ju
じょ
[(d)ʑo̞]
jo
t ちゃ
[tɕa]
cha
ちゅ
[tɕɯ]
chu
ちょ
[tɕo̞]
cho
d ぢゃ
[(d)ʑa]
ja
ぢゅ
[(d)ʑɯ]
ju
ぢょ
[(d)ʑo̞]
jo
h ひゃ
[ça]
hya
ひゅ
[çɯ]
hyu
ひょ
[ço̞]
hyo
p ぴゃ
[pʲa]
pya
ぴゅ
[pʲɯ]
pyu
ぴょ
[pʲo̞]
pyo
b びゃ
[bʲa]
bya
びゅ
[bʲɯ]
byu
びょ
[bʲo̞]
byo
m みゃ
[mʲa]
mya
みゅ
[mʲɯ]
myu
みょ
[mʲo̞]
myo
n にゃ
[ɲa]
nya
にゅ
[ɲɯ]
nyu
にょ
[ɲo̞]
nyo
r りゃ
[rʲa]
rya
りゅ
[rʲɯ]
ryu
りょ
[rʲo̞]
ryo
Katakana Digraphs (拗音 (ようおん))
ya yu yo
k キャ
[kʲa]
kya
キュ
[kʲɯ]
kyu
キョ
[kʲo̞]
kyo
g ギャ
[gʲa]
gya
ギュ
[gʲɯ]
gyu
ギョ
[gʲo̞]
gyo
s シャ
[ɕa]
sha
シュ
[ɕɯ]
shu
ショ
[ɕo̞]
sho
z ジャ
[(d)ʑa]
ja
ジュ
[(d)ʑɯ]
ju
ジョ
[(d)ʑo̞]
jo
t チャ
[tɕa]
cha
チュ
[tɕɯ]
chu
チョ
[tɕo̞]
cho
d ヂャ
[(d)ʑa]
ja
ヂュ
[(d)ʑɯ]
ju
ヂョ
[(d)ʑo̞]
jo
h ヒャ
[ça]
hya
ヒュ
[çɯ]
hyu
ヒョ
[ço̞]
hyo
p ピャ
[pʲa]
pya
ピュ
[pʲɯ]
pyu
ピョ
[pʲo̞]
pyo
b ビャ
[bʲa]
bya
ビュ
[bʲɯ]
byu
ビョ
[bʲo̞]
byo
m ミャ
[mʲa]
mya
ミュ
[mʲɯ]
myu
ミョ
[mʲo̞]
myo
n ニャ
[ɲa]
nya
ニュ
[ɲɯ]
nyu
ニョ
[ɲo̞]
nyo
r リャ
[rʲa]
rya
リュ
[rʲɯ]
ryu
リョ
[rʲo̞]
ryo

Hiragana is primarily used to represent grammatical elements such as particles, suffixes and inflectional endings. It is also used for certain native Japanese words.

Katakana is primarily used when writing words of foreign origin and onomatopoeias. It can also be used to emphasize a word (similar to using italicizing it). Company names, plant and animal names as well as slang words are usually written in katakana, too.

Long Vowels#

Writing long vowels in [[Orthography and Phonology#Kana|hiragana]] is done in the following ways:

Representing Long Vowels in Hiragana
Spelling Pronunciation Occurrence
ああ /aː/ native words
いい /iː/ native words
うう /uː/ native words and Sino-Japanese words
えい /e/ or /e.i/ native and Sino-Japanese words
おお /oː/ native words
おう /oː/ native and Sino-Japanese words

Writing long vowels in [[Orthography and Phonology#Kana|katakana]] depends on the word's origin. In loanwords, the symbol "ー" is used to represent any long vowel depending on the katakana symbol which precedes it. When native or Sino-Japanese words are written in katakana, the same conventions are used as with hiragana.

Example: Long Vowels in Katakana

テーブル /te̞ːbɯɾɯ/ - table

レコード /ɾe̞ko̞ːdo̞/ - record

アイスクリーム /aisɯkɯɾiːmɯ/ - ice cream

Consonant Gemination#

Consonant [[TODO|gemination]] is achieved by placing a smaller version of the kana for "tsu" (っ for hiragana and ッ in katakana) before a kana which starts with a consonant. This smaller symbol is called 促音(そくおん, sokuon) and is counted as a separate mora.

Example: Consonant Gemination

しっぱい /ɕipːai/ - failure

サッカー /sakːaː/ - soccer

Usually, only unvoiced consonants can be geminated. However, the consonants /n/ and /m/ are geminated when preceded by ん (hiragana) or ン (katakana). In these situations, ん / ン assimilate to /n/ or /m/, respectively.

Kanji#

Kanji ((漢字(かんじ))) are the symbols which comprise the [[Writing Systems#Logographies|logographic]] script used by the Japanese writing system. Historically, these symbols were adapted from the [[TODO|Chinese]] script and, in contrast to [[Orthography and Phonology#Kana|kana]], they mostly represent meaning instead of sound.

Each kanji has a correct way of writing (stroke order) and memorizing it is crucial to being able to read and write handwritten kanji, although it is not necessary if you only plan on reading printed Japanese.

Readings#

Kanji can have multiple pronunciations, called readings, depending on the context. These readings are divided into two categories:
- 音読み(おんよみ, on’yomi, ON readings) - these readings derive from Japanese approximations of pronunciations from earlier stages of various Chinese languages;
- 訓読み(くんよみ, kun’yomi, KUN readings) - these are native Japanese readings.

Most kanji have at least one reading from each category. In dictionaries, kun'yomi are written in hiragana and on'yomi are written in katakana. Contextual cues are needed to decide which reading to use for a given kanji.

Example:

Moreover, kanji are sometimes used in a way similar to kana, i.e. for their pronunciation instead of their meaning. In such cases, the kanji are known as ateji (当て字, 宛字, あてじ) and this practice has largely fallen out favor and has been replaced by katakana. Nevertheless, some common words are still written like this.

Example: Common Ateji

The word "sushi" is often written as "寿司". The character "寿" is read as "su" and the character 司 is read as "shi". However, "寿" means "one's natural lifespan" and "司" means "to administer".

Radicals#

Kanji are composed of smaller graphical elements known as radicals. Some radicals also function as standalone kanji. If you recognize the radicals which compose a given unknown kanji, then you can often guess both what meaning the kanji has, how it should be pronounced and what the correct stroke order for it is. However, radicals often have an altered appearance in various kanji, so it is often not optimal.

Tip: Learning Kanji

Instead of learning each kanji in a word together with a comprehensive list of all of its possible meanings and readings, you should just learn the meaning and pronunciation of the entire word. With time, you will naturally be able to pick up how kanji in new words are pronounced and what they mean.

Furigana#

Kanji may sometimes be written alongside either [[Orthography and Phonology#Kana|hiragana]] or [[Orthography and Phonology#Kana|katakana]] to indicate their reading such as かん. This is usually done in educational contexts or when the reading of a kanji is ambiguous. Kana written in this way is known as furigana.

Phonology#

Vowels#

[[index|Japanese]] has five [[TODO|vowel]] [[TODO|phonemes]].

Japanese Vowel Phonemes
Front Central Back
Close i u
Mid e o
Open a

These [[TODO|phonemes]] are realized by the following [[TODO|allophones]]:
- /a/ by either [ä] or [ɑ̟];
- /i/ by [i];
- /u/ by [ɯ];
- /e/ by [e̞];
- /o/ by [o̞].

The vowels /i/ and /ɯ/ are often devoiced when they are between or after the consonants /k/, /s/, /t/, /p/. The movement of the lips and tongue are essentially the same, but the vocal cords do not vibrate.

Consonants#

Moras#

The [[TODO|phonological]] system of [[index|Japanese]] is based on moras, not [[TODO|syllables]]. One can think of spoken Japanese as being accompanied by a regular pulse with a constant tempo. Each beat corresponds to exactly one mora. Unlike syllables, which can be either short or long, moras always have the same length. This means that a single syllable can be comprised of one or more moras.

Pitch Accent#

English has a stress accent system in which each word has one syllable which is emphasized more than the rest by slightly increasing its length and volume. By contrast, [[index|Japanese]] has a pitch accent ((高低アクセント, kōtei akusento)) system. A [[Orthography and Phonology#Moras|mora]] is accented by pronouncing it with a high pitch followed by a drop in pitch on the next mora in the word. This drop in pitch is known as a downstep.

Each word has either no moras accented (unaccented words) or exactly one mora accented (accented words), which means that once tone falls from high to low, it cannot go back to high within the same word.

There are four fundamental patterns for the pitch accent which words may follow:

Japanese Pitch Accent Patterns
Name Description Countour Example
(へい)(ばん)(がた)
heibangata
This is the pattern which unaccented words follow. The pitch begins low on the first mora, moves slightly higher on the second mora and remains at that level for the rest of the word. / ̄( ̄) Example
(あたま)(だか)(がた)
atamadakagata
The accent falls on the first mora. The pitch begins high on the first mora, moves down on the second mora and remains at that level for the rest of the word. \_(_) Example
(なか)(だか)(がた)
nakadakagata
The accent falls on a mora in the middle of the word. The pitch begins low on the first mora, rises gradually until it peaks at the accented mora, then moves down on the subsequent mora and remains at that level for the rest of the word. /( ̄)\ Example
()(だか)(がた)
odakagata
The accent falls on the last mora of the word. The pitch begins low on the first mora and rises gradually until it peaks at the accented mora. If the word undergoes agglutination, then the pitch drops low on the first particle and remains at that level for the rest of the particles. / ̄(\_) Example

Sources#

  1. Japanese Writing System - Wikipedia
  2. Japanese Phonology - Wikipedia
  3. Hiragana - Wikipedia
  4. Katakana - Wikipedia
  5. Kanji - Wikipedia
  6. Furigana - Wikipedia
  7. Ateji - Wikipedia
  8. Pronunciation I: Vowels - IMABI
  9. Pronunciation II: Consonants - IMABI
  10. Pronunciation III: Pitch Accent & Devoicing - IMABI
  11. Hiragana ひらがな - IMABI
  12. Katakana カタカナ - IMABI
  13. Kana III: Orthography - IMABI
  14. Kanji Intro I - IMABI
  15. Kanji Intro II - IMABI