1=Language considerations (Indonesian)Please see Module:checkparams for help with this warning.
This is a Wiktionary policy, guideline or common practices page. Specifically it is a policy think tank, working to develop a formal policy. | |
Policies – Entries: CFI - EL - NORM - NPOV - QUOTE - REDIR - DELETE. Languages: LT - AXX. Others: BLOCK - BOTS - VOTES. |
This policy explains considerations for Indonesian entries that are not covered by WT:ELE and other general policies.
This is a simple entry for the word kamus (“dictionary”), and shows the most fundamental elements of an article:
==Indonesian== ===Etymology=== From {{inh|id|ms|kamus}}, from {{der|id|ar|قاموس|t=dictionary, lexicon}}. ===Pronunciation=== * {{IPA|id|/ka.mus/}} * {{hyphenation|id|ka|mus}} ===Noun=== {{head|id|noun}} # ] ===Further reading=== * {{R:KBBI Daring}} ----
A lemma is the canonical form of an inflected word; i.e., the form usually found as the headword in a dictionary, such as the nominative singular of a noun, the bare infinitive of a verb, etc. In Indonesian dictionaries, such as KBBI, kata dasar (“lexical stem”) treated as lemma, and most derived words (except prefixed di-, -kah, -ku, -lah, -mu, -nya) treated as sublema (sublemma). For example, beli (“buy”) is a lemma, and membelikan is a sublema, but membelikannya is not a lemma. However, for practical consideration, lema and sublema are treated as lemma in this Wiktionary.
As Indonesian is written only in the Latin script, the lemma entry in Indonesian is the kata dasar (“lexical stem”) in Latin script using the latest spelling (2015 Ejaan Bahasa Indonesia).
The layout for lemma entries in Indonesian is based on Wiktionary:Entry layout. For non-lemma entries, see non-lemma entries for specific considerations.
As for editing, {{also}}
shall be added to current edit if necessary.
Alternative forms are variations of the same word kept in multiple pages. Examples include:
The standard form (bentuk baku) listed in {{R:KBBI Daring}}
is usually considered the standard form.
Warning: Do not list the word which standard by KBBI but never used by speaker as standard form, use commom form.
Entries that are alternative forms will have templates such as these at the definition line to link the entry back to the standard form.
{{alternative spelling of|id|WORD}}
{{obsolete spelling of|id|WORD}}
{{alternative case form of|id|WORD}}
{{nonstandard form of|id|WORD}}
The "Alternative forms" header can also be listed after the part of speech, if the spelling is specific to that part of speech.
The first header below the language heading is usually the level 3 “Etymology” header. The etymology is given right below the header without indentation. Etymology essentially shows where the word comes from. This may show the forms in other languages that underlie the word. For many modern words it may show who coined the word. If a word is derived from another in the same language by a regular rule, such as formation of an English adverb by adding “ly”, it is not necessary to repeat the complete details of the word’s origin on the page for the derived word.
Sometimes two words with different etymologies belong in the same entry because they are spelled the same (they are homographs). In such a case there will be more than one “Etymology” header, which we number. Note that in the case of multiple etymologies, all subordinate headers need to have their levels increased by 1 in order to comply with the fundamental concept of showing dependence through nesting.
A significant category of words in a language are the so-called ‘native’ or ‘inherited’ words; in some languages, but not all, they form the majority of words. This means that they have developed from an earlier form of the language which may or may not have gone by the same name. The ancestor languages of Indonesian are in this order:
ms
)ms-cla
)ms-old
)poz-mly-pro
)poz-mcm-pro
)poz-msa-pro
)poz-sus-pro
)poz-pro
)map-pro
)For words with an unbroken chain of inheritance, {{inh|id|lang|WORD}}
is used. If the source of the inherited word is borrowed from a nonancestral language, e.g. a Malay word borrowed from Sanskrit/Portuguese/Arabic or a Proto-Malayo-Polynesian word borrowed from Proto-Oceanic, then {{der}}
is used instead of {{inh}}
. See Template:inherited for a more detailed explanation.
For informative purpose, add words from Classical Malay, only show if those words attested.
Some words have been borrowed from other languages, either because of a historical occupation or co-existence, or simply through exposure to other languages. Borrowings can be ancient or recent. When words are first borrowed into a language they may still ‘seem’ foreign. After a while they become more naturalized. Eventually they seem completely native. To flag a borrowing, use {{borrowed}}
or the shortcut {{bor}}
. For extra consideration for borrowings/loanword, see Borrowings/Loanword below.
{{calque}}
should be used. For example, the term of sekolah rakyat is a calque of 国民学校 (kokumin gakkō). Thus, it is written as {{calque|id|ja|国民学校|tr=kokumin gakkō}}
.{{learned borrowing}}
should be used. For example, the term of fakultas is a learned borrowing of facultas. Thus, it is written as {{learned borrowing|id|la|facultas}}
.{{phono-semantic matching}}
should be used. For example, the term of moda raya terpadu is a phono-semantic matching of mass rapid transit. Thus, it is written as {{phono-semantic matching|id|en|mass rapid transit}}
.{{semantic loan}}
should be used. For example, the term of garam (one of the compounds formed from the reaction of an acid with a base) is a semantic loan of salt. Thus, it is written as {{semantic loan|id|en|salt}}
.{{back-form}}
helps here. For example, citrum is a back-formation of citra, thus it uses {{back-form|id|citra}}
in the citrum page.{{blend|id|layanan|tanpa|turun}}
can be used for lantatur.{{calque}}
should be used. For example, the term of sekolah rakyat is a calque of 国民学校 (kokumin gakkō). Thus, it is written as {{calque|id|ja|国民学校|tr=kokumin gakkō}}
.{{clipping|id|-}}
{{reduplication|id|mata}}
.The "Pronunciation" section includes the transcriptions, audio pronunciations, rhymes, hyphenations and homophones.
{{IPA|id}}
followed by the transcription in the International Phonetic Alphabet is used. Phonemic transcriptions are placed between diagonal strokes (/ /) while phonetic transcriptions are placed between square brackets ().{{accent|region}}
(e.g. {{a|Papua}}
) is used before {{IPA|id}}
.{{hyphenation|id|ka|ta}}
.{{homophone|id|term 1|term 2}}
. For example, bank and bang (“older brother”). If a word is a homophone in a particular dialect, then |q1=Jakarta
can be used to specify the dialect for that particular homophone.{{rfp|id}}
can be used to request a pronunciation in a Wiktionary entry.Template {{id-pr}}
can used to substituted to many templates.
The part of speech (POS) is a descriptor like “Noun” or “Adjective”. The allowed POS are:
The headword line is the line directly below the part of speech header, in which the word is repeated, along with a alternative spelling with diacritics (see below) if applicable. For example, headword line of lensa (“lens”) is {{head|id|noun|head=lènsa}}
.
Although diacritics are often omitted in daily usage, diacritic is used in dictionary (mainly KBBI) to differentiate distinctive phoneme of "e", specifically /ə/ and /e/. The phoneme /e/ can be realized as and . While the used diacritic are varied on preference of the author, the diacritic based on KBBI.
If the letter "e" is present in the headword, then an additional parameter |head=
is required with the normal "e" replaced by one of the following:
e.g. {{head|id|verb|head=sêlèwèng}}
for seleweng (“to deviate”).
Remember not to do add diacritics to hyphenation section.
The definitions are in the POS section, below the headword line. The definitions are organized as a numbered list. The numbers are generated by adding the number sign (#) at the start of each definition in the wikitext. The key terms of a definition should be linked to the respective entries.
A context label is placed before a definition to indicate any one of the following:
Many context label templates also categorize an entry into a relevant category, but they must not be used merely for categorization. Examples of recognized labels can be found in Appendix:Glossary. Context labels commonly used in Indonesian entries include the following:
{{lb|id|acronym}}
{{lb|id|archaic}}
{{lb|id|classifier}}
{{lb|id|colloquial}}
{{lb|id|dated}}
{{lb|id|dialectal}}
{{lb|id|diminutive}}
{{lb|id|euphemism}}
{{lb|id|figurative}}
{{lb|id|formal}}
{{lb|id|historical}}
{{lb|id|neologism}}
{{lb|id|informal}}
{{lb|id|nonstandard}}
{{lb|id|obsolete}}
{{lb|id|outdated}}
{{lb|id|proscribed}}
{{lb|id|short form}}
{{lb|id|slang}}
{{lb|id|vulgar}}
Several "form of" template are permitted in definition section. There are several shortening "form of" templates, such as:
{{abbreviation of|id|bea cukai}}
in becuk.{{acronym of|id|ilmu pengetahuan alam}}
in IPA.{{clipping|id|-}}
{{aphetic form of|id|otomobil}}
in mobil.{{apocopic form of|id|pengen}}
in pen.{{syncopic form of|id|kayak}}
in kek.{{contraction of|id|selalu}}
in s'lalu.{{ellipsis of|id|pispot}}
in pot or {{ellipsis of|id|dokter spesialis}}
in spesialis.{{initialism of|id|jaksa penuntut umum}}
in JPU.{{short for|id|kata sandi}}
in sandi.Indonesian is a agglutinative language, thus words may contain different morphemes to determine their meanings.
{{superlative of}}
. For example, {{superlative of|id|baik|t=best}}
for terbaik.{{alternative spelling of}}
or {{alt sp}}
{{dated spelling of}}
{{censored spelling of}}
{{deliberate misspelling of}}
{{informal spelling of}}
{{misspelling of}}
{{nonstandard spelling of}}
{{pronunciation spelling of}}
{{superseded spelling of}}
{{uncommon spelling of}}
{{passive of|id|merawat|t=}}
{{form of|id|base-imperative-colloquial|merawat|t=}}
Quotations can be used to provide evidence that a word or sense exists (as required by WT:ATTEST), and to provide examples of how the word is used. For words such as alternative spellings or nonstandard compound words that are attested in Indonesian but not found in major dictionaries such as {{R:KBBI Daring}}
, the addition of quotations is highly recommended.
# Definition of word in English #* {{quote-journal|id|author=''Name of author''|title=''Title or headline''|trans-title=''English translation of title''|journal=''Name of publication, journal or newspaper''|url=''URL''|archiveurl=''Internet Archive URL''|archivedate=''Date of Internet Archive link''|date=''Date of publication of article''|text=''Sentence containing the '''word''' made bold|translation=English translation of sentence with translated '''English word''' in bold}}
Generally, every definition should be accompanied by a quotation illustrating the definition. If no quotation can be found, it is strongly encouraged to create an example sentence.
e.g. kementerian, the Indonesian word for ministry:
# Definition of word in English #: {{ux|id|Example sentence (not in italics), with '''kementerian''' made bold. |translation=An English translation of the sentence with '''ministry''' in bold.}}
Synonym, antonym, hypernym, hyponym, meronym, holonym, and troponym are examples of semantic relations. Detailed explanation and usage are covered in Wiktionary:Semantic relations.
If there are multiple parts of speech and it is not known from which part of speech a certain derivative was formed, then semantic relations will be on the same level as the part of speech headings.
List terms in the same language that are morphological derivatives. In Indonesian, affixed word (kata berimbuhan) and compound word (kata majemuk) are part of derived terms.
Usage of {{id-der}}
is recommended for affixed derivatives.
List words in the same language that have strong etymological connections but are not derived terms. Each term should be wikified with {{l|id|WORD}}
. Note that words with similar meanings but are not etymologically related are listed under the "See also" header.
The template {{desc|lang|WORD}}
is used to list terms in other languages that have borrowed or inherited words from Indonesian. The etymology of these terms should then link back to the Indonesian lemma.
See Wiktionary:List of languages for a complete list of language codes.
The "See also" section is used to link to entries and/or other pages on Wiktionary such as appendices and categories. Don't use this section to link to external sites such as Wikipedia or other encyclopedias and dictionaries.
The "References" section contains reference works where users can verify the information available on our entries. This improves the reliability and usefulness of Wiktionary. References are especially encouraged for unusual or disputable claims in etymologies or usage notes. Detailed explanation and usage are covered in Wiktionary:References. Templates such as {{cite-book}}
and {{cite-journal}}
are usually used for references.
The "Further reading" section contains simple recommendations of further places to look.
{{R:KBBI Daring}}
and encyclopedias such as Wikipedia or 1911 Encyclopædia Britannica which may be available online or in print.The "Anagrams" section is for anagrams such as asam (“sourness”), masa (“time”) and sama (“same”). Anagrams are listed after the "Further reading" section.
e.g. entry for masa:
===Anagrams=== {{anagrams|id|asam|sama}}
Topic categories such as Category:id:Animals or custom categories such as Category:Indonesian basic words can be added by appending the following at the bottom of entries:
{{topics|id|Animals}} {{categorize|id|Indonesian basic words}}
The complete list of topic categories available can be found at Category:List of topics (note that the first letter is capitalized). List of custom categories available can be found at Category:Indonesian language.
An affixed word (kata berimbuhan) is a word which has been affixed with prefix, infix, or suffix, while a compound word (kata majemuk) is a lexeme (less precisely, a word or sign) that consists of more than one stem, but used in single unit. In most aspects, the entry layout is similar to Lemma entries. However, there are several points.
{{compound}}
. For example, sekolah rakyat uses {{compound|id|sekolah|rakyat|t1=school|t2=people}}
.{{prefix|id|ber-|cerita}}
, adakan uses {{suffix|id|ada|-kan}}
, and gerigi uses {{infix|id|gigi|-er-}}
. It is important to use standard affixes (not use the assimilated form) as Tata Bahasa Baku Bahasa Indonesia, i.e. melaut uses {{prefix|id|meng-|laut}}
, instead of {{prefix|id|me-|laut}}
.Part of speech is similar in Lemma entries, however the headword line uses the (POS) form in order to be categorized as a non-lemma. For example, the entry for bercerita is written as below:
===Verb=== {{head|id|verb form|head=bêrcêrita}} # {{lb|id|intransitive}} to ] a ]
Possessive forms for nouns such as rumahku (my house), rumahmu (your house), rumahnya (his/her/their house) are automatically displayed using {{id-noun}}
which is equivalent to the following:
{{head|id|noun form|plural|{{l|id|buah-buah arbei}}
|first-person possessive|buah arbei{{l|id|-ku|ku}}
|second-person possessive|buah arbei{{l|id|-mu|mu}}
|third-person possessive|buah arbei{{l|id|-nya|nya}}
}}
Note that separate entries are not created for these possessive forms.
During the course of time, the Indonesian language has undergone several spelling reforms. This resulted in different historical spellings of the same word. To reduce duplication of content, historical form entries would serve as soft redirects to standard form entries, and have no definitions or other information except for quotations. All other information belong to standard form entries.
e.g. entry for Jogjakarta:
==Indonesian== ===Proper noun=== {{head|id|proper noun}} # {{superseded spelling of|id|Yogyakarta}}
The categories for historical variation are: archaic, dated, obsolete. See Classifications of old words for specific considerations.
Several words have nonstandard forms which can be found in everyday setting. To reduce duplication of content, nonstandard form entries would serve as soft redirects to standard form entries, and have no definitions or other information except for quotations. All other information belong to standard form entries.
==Indonesian== ===Noun=== {{head|id|noun}} # {{nonstandard spelling of|id|kacamata}}
The standard form entries shall list these nonstandard forms under the Alternative Forms header:
===Alternative forms=== * {{alter|id|kaca mata}}
If there is an entry with both standard form and nonstandard form that have different definitions, then that entry shall be separated into several etymologies. The etymology with standard form shall be entered as lemma, while the one with nonstandard form shall be entered as nonstandard forms. Overall entry layout is similar to the lemma entry.
In KBBI many foreign languages common phrases listed as ungkapan. These phrases or idioms are not Indonesian vocabularies. Remember not to do add these phrases which identical meanings to the original languages. Only add if these phrases have different meanings in Indonesian usages.
Note that {{R:KBBI Daring}}
introduce lemmas from other languages to popularization purpose, many of these terms not yet attested, such as:
Words with these labels in KBBI deserve entries in their respective languages. Editors are encouraged to create entries in the original languages first before creating the Indonesian entry as some of the words may not be attestable in spoken or written forms of Indonesian.
Indonesian and Standard Malay (the national language of Malaysia, Singapore, Brunei) have a common origin that can be traced back to the Classical Malay of the Johor-Riau Sultanate. Historically, Malay was the lingua franca used by maritime traders in the Indonesian archipelago. During the Dutch colonial era, Malay in Latin script was introduced as a school subject around late 19th century.
The earliest mention of "bahasa Indonesia" (Indonesian language) was in the 1928 Sumpah Pemuda (Youth Pledge). After independence in 1945, the language underwent further expansion in vocabulary with addition of new technical, scientific and professional terms by the Lembaga Bahasa dan Budaya (Institute for Language and Culture).
As of 2019, Indonesian and Malay are treated as separate languages on Wiktionary. Examples of differences between the two languages can be found at Wikipedia:Comparison of Standard Malay and Indonesian.
Lemmas that existed during the period of Dutch colonization are recorded in modern dictionaries such as Kamus Besar Bahasa Indonesia (labeled as Bld) and Kamus Dewan (labeled as IB).
Generally, words attested before 1928 are to be placed under the Malay language. (See Sumpah Pemuda)
For words attested between 1928 until independence in 1945 the following considerations are made:
{{lb|ms|historical|Dutch East Indies}}
before the definition to categorize the entry into Category:Dutch East Indies Malay. This format is also applicable for words attested before 1900.{{obsolete spelling of|id|Modern spelling}}
as its definition.Note that spelling from 1901 to 1947 is based on Van Ophuijsen’s Kitab Logat Melayu: Woordenlijst voor de spelling der Malaisch taal met Latijnsch karakter (Malay Vocabulary: Wordlist for the spelling of the Malay language with Latin characters).
In principle, the latest spelling standard is used to determine which variant spelling to place a word’s definition at. Indonesian has experienced several spelling reforms. The spelling reforms are summarized as below:
International Phonetic Alphabet | (1972) | (1967) | (1947) | (1901) |
---|---|---|---|---|
Ejaan yang Disempurnakan | Ejaan Baru / Lembaga Bahasa dan Kesusastraan | Ejaan Republik / Soewandi | Ejaan van Ophuijsen | |
t͡ʃ | c | tj | ||
d͡ʒ | j | dj | ||
x | kh | ch | ||
ɲ | ny | nj | ||
ʃ | sy | sj | ||
j | y | j | ||
u, ʊ | u | oe | ||
ʔ | k | ' |
If the word is a modern borrowing from a contemporaneous language, {{bor|id|lang|WORD}}
is used. For indirect borrowings or earlier borrowings, {{der|id|lang|WORD}}
is used instead.
For modern borrowings from extinct languages such as Sanskrit or Latin, {{learned borrowing}}
or its shortcut {{lbor}}
is used.
For long chains borrowing considered using {{dercat|id}}
. For example:
===Etymology=== {{dercat|id|dum|odt|gem-pro|la}} From {{bor|id|nl|keizer}}.
See Wiktionary:List of languages for a complete list of language codes.
In Indonesian, most loanwords from Chinese language are not based on Mandarin Chinese (官話/官话 (guānhuà)/普通話/普通话 (pǔtōnghuà)/國語/国语 (guóyǔ)/華語/华语 (Huáyǔ), cmn
), but are based on one of the following:
nan
), which can be further divided into:
hak
)yue
).Thus, it is important to input the specific language source in Traditional Chinese characters along with the pronunciation and related definition. Otherwise, the Mandarin romanization is displayed by default. For example, bakpao (“baozi”), which is based on Min Nan 肉包 (bah-pao), is written as below:
===Etymology=== From {{bor|id|nan|-}} {{zh-l|肉包|gloss=]|tr=bah-pao}}
As part of the Indosphere, Indonesian has significant influence from Sanskrit (Bahasa Sanskreta). This influence can be borrowed and inherited through Malay, or borrowed through Javanese or other languages, such as Balinese. These multiple sources can result in multiple borrowings, such as berida (“old”) and wreda (“old”). Thus, for borrowings for Sanskrit, the etymology and original word in Sanskrit shall be entered appropriately depending on how it is borrowed.
For modern borrowings, i.e. learned borrowings, the following is used:
From {{lbor|id|sa|Word in Sanskrit}}
.
For terms inherited through 19th century Malay or Classical Malay, the following is used:
From{{inh|id|ms|Word in Malay}}
, from{{der|id|sa|Word in Sanskrit}}
.
For terms borrowed through local languages such as Javanese and Balinese, the following is used.
From{{bor|id|lang|Word in Javanese/Balinese}}
, from{{der|id|sa|Word in Sanskrit}}
.
Do not use {{bor|id|sa}}
as Sanskrit and Indonesian are not contemporaneous languages.
In Indonesian, the majority of European loanwords are from Dutch or English. Some words may also be calques, e.g. pencakar langit (“skyscraper”).
For modern borrowings, {{bor|id|lang|WORD}}
is used while {{calque|id|lang|WORD}}
is used for calques. The ancestor of the borrowed word, e.g. Proto-Germanic (gem-pro
), Proto-Indo-European (ine-pro
) can be additionally stated using {{der|id|lang|WORD}}
to categorize the lemma into categories such as Category:Indonesian terms derived from Proto-Germanic.
e.g. entry for kaisar (“emperor; empress”)
===Etymology=== From {{bor|id|nl|keizer}}, from {{der|id|dum|keiser}}, from {{der|id|odt|keiser}}, from {{der|id|gem-pro|*kaisaraz}}, from {{der|id|la|Caesar}}.
The pronunciation may be entered in both phonemic (using / /) and phonetic form (using ). If both are provided, the two transcriptions are separated using |
.
The following tags are not intended to dictate whether or how to use the tagged entries, but to inform the reader of the modern rarity and possible connotations within modern contexts.
Classical Malay words (Etymology-only code ms-cla), used 14th to 18th century, are spelt differently from current spelling, or completely different in meaning, as to be virtually a foreign language to modern Indonesian speakers. Entries for such terms are treated as foreign words with the L2 language heading ==Malay==, categorized within Category:Classical Malay, or treated as Indonesian if 'used' in Modern Indonesian (see explanation below). Classical Malay terms which absence in modern time labelled as 'kl' in KBBI.
No longer in use; found only in very old texts. Can also apply to a no longer understood definition of a word. Examples: "to put" definition of membubuhkan.
For vocabularies inherited from Classical Malay and never used in modern day treated as obsolete. For words still used or re-used in modern day usually treated poetic or literary.
No longer in general use, but still found in some contemporary texts (e.g. the Bible). Example, memaklumkan.
The term arkais (archaic) used in KBBI have slightly different usage compared to Wiktionary partice. It's can be translated as dated, archaic or obsolete.
Still in use, but generally only by older people, and considered unfashionable, superseded or dated, particularly by younger people. Examples: tapal gigi (in sense "toothpaste")
Please keep in mind that what may be considered "unfashionable" or "dated" in one region may not be in another, so where possible please include regional information. If in doubt, make a Usage Note stating that the term may be considered unfashionable or dated in some areas.
Verb is a word that indicates an action, event, or state. Verbal prefix in Indonesian consist of meng-, per-, and ter-. Verbal suffixes in Indonesian consist of -kan and -i.