Use the Möllendorf transcription system for Manchu because currently no one could display the Manchu script correctly in a browser.
Things are tougher than you may think. Unicode supports Mongolian, Manchu, Sibe and Todo scripts but it is difficult to display or input these scripts. This page briefly explains why: http://homepage.mac.com/thgewecke/Manchu.html I don't know any browser (or font) that treats positional variants correctly. Also, the implementation of "writing-mode: tb-lr" (CSS3 Text Module) is required.
I made a simple conversion script (from the transcription to Unicode sequence), but I'm not sure if my understanding is correct. Here is a test:
ᠮᠠᠨᠵᡠ ᡤᡳᠰᡠᠨ
(manju gisun) --Nanshu 01:32 Aug 12, 2003 (UTC)
I can see that script in my browser. Petruk 18:01 Aug 12, 2003 (UTC)
Your bot add a lot of pages about chinese characters... I can't see many of them. do you know which font I need ? Koxinga 12:15, 10 Sep 2003 (UTC)
What I currently add are in "CJK Unified Ideographs Extension A" (U+3400 to U+4DBF; 6,582 characters), which were adopted in Unicode 3.0. This area was once occupied by Hangul Syllables, but they were reallocated to another area violating one of Unicode's principles. The Chinese characters, added in turn, were very rare. I confess that I have never seen these words. To show them, you have to get a relatively new Unicode font. If you use Windows, this page will help you. Now I decided to complete "CJK Unified Ideographs" first. --Nanshu 23:41, 10 Sep 2003 (UTC)
Great work that you've been doing on all those Chinese character pages. By way of suggestion, you could revise the "Radical" line to read as follows:
The radical number is especially useful for the non-Chinese for whom the order in which the radicals appear in the dictionary is not always obvious. Adding the word "stroke" at the end simply clarifies that that is what the number is about. Keep up the good work. Eclecticology 09:41, 18 Sep 2003 (UTC)
Hi, Nanshu: I added some comments and questions on the Entries on Chinese characters meta page...basically, I was wondering where the Hangeul index came from (I'm wondering how complete or accurate it is); also, I was wrong about using the word "Hum-eun": it should be "eumhun" (音訓), since that is the word in Korean. Finally, you mentioned the NK/SK spelling variation (롱/농), but this is tricky: in South Korea, when a character that starts with ㄹ is used on its own or at the beginning of a word, it is replaced with a ㄴ (or ㅇ if ㅑ, ㅕ, ㅛ, ㅠ, or sometimes ㅣ follows it. (Examples: 羅 (라) in 羅州 나주; the family name 李 (리), which is usually written & pronounced 이.) But when the character comes in the middle or at the end of a word, the ㄹ is maintained (so a South Korean would write 용 (龍) for "dragon," but 백룡 (白龍) for "white dragon"). This rule also applies to the eumhun reading of Hanja: South Koreans normally write 육 for 六, but Chinese character dictiories call the character "여섯 륙" (yeoseon-nyuk). For 용, the eumhun reading is "용 룡" (yong-nyong)! In North Korea, however, the ㄹ is always used, even at the beginning of a word--so a North Korean would spell the South Korean city name 나주 as 라주, and romanize it as Raju (like Rajin near the Russian border), instead of Naju. This consonant-changing rule also applies to ㄴ when it comes before ㅑ, ㅕ, ㅛ, ㅠ: for example, 녀 (女), which is spelled and pronounced 여 in 여자 (女子).
...I'm also going to post what I just wrote on the Entries on Chinese characters meta page because it's relevant to the discussion, but I'll leave it here for you as well. --Sewing 16:15, 2 Oct 2003 (UTC)
--Nanshu 22:56, 2 Oct 2003 (UTC)
Now your bot seems to be adding only the Korean section and not the Chinese or Japanese sections; is this a bug? (See 怾 for an example.) --Sewing 00:47, 3 Oct 2003 (UTC)
Sorry...I was going to say that the bot still seemed to be buggy, but I guess it only adds a section for each language if there is data for that language...is that correct? --Sewing 02:14, 4 Oct 2003 (UTC)
Congratulations for getting throught all those CJK characters. Great job. Eclecticology 02:43, 22 Nov 2003 (UTC)
Hi Nanshu,
I hope you are still around. I'm struggling to get my PHP bot script operational again. Apparently the new software expects a different kind of interaction than before. Could you share with me how Nanshubot interacts with the wiktionary server software? Many thanks,Polyglot 22:14, 19 Dec 2003 (UTC)
Hello. NanshuBot is a POST-only bot. It isn't clever enough to modify existing articles. Sorry if I cannot help you.
My bot is written in Perl. It sends the following request.
my $request = POST("http://wiktionary.org/w/wiki.phtml?title=$url_name&action=submit", { 'wpTextbox1' => $output, #'wpMinoredit' => 1, # debug 'wpSummary' => $comment, # 'wpPreview' => 'Show preview', 'wpSave' => 'Save page', 'wpSection' => '', 'wpEdittime' => '' }); $request->header('Cookie' => 'wiktionarywikiUserName=NanshuBot;wiktionarywikiUserID=421;wiktionarywikiPassword=********************************'); # wiktionary
This only works for non-existent articles. In this case, the server returns "Redirect". Otherwise, my bot updates the local error log (and I updates the articles manually). To edit an existing article, 'wpEdittime' should be set correctly. My bot with manually set 'wpEdittime' successfully updated User:Nanshu/Sandbox. --Nanshu 05:34, 20 Dec 2003 (UTC)
Hello Nanshu. Very nice work. I'm willing to link my little Zhendic on your useful wiktionary entries. I have few usability and readability suggestions.
The character displays in bold in entry title. On my computer, it's not very easy to read for some characters with many strokes. I'm not sure that it's possible to change this H1 layout. So I may suggest to add a first line with info readers would want to find quickly, something like a summary of the page. It could be like this:
I don't know if you plan to add multiple characters "words", but it could be useful also. I can send you the database I used (it is public domain, I think).
If you wish to add very valuable meat to English definitions, especially from sinological point of view, one may ask to those who made http://kongzi.arthum.hawaii.edu:8080
This gives you something like that:
By the way, your work is very fine and your bot very friendly :) 202.106.182.176 10:54, 8 Jan 2004 (UTC) (gbog on wikipedia)
I know it's a bit late, but I've added three new Chinese indices:
and I'm wondering if you could help populating these indices.
I'm also slightly improving/redoing the TOCs in these indices, for ease of navigation.
KelvSYC 04:25, 5 May 2004 (UTC)
Nanshu, I should have written this many months ago: thank you so much for the CJK Unified Ideographs pages you completed with Nanshubot! They became a very important tool for me when surfing in the net. I have altogether saved long amounts of hours using the wiktionary ideographs in here. Great idea and excellent work! Congratulations! - Piolinfax 14:39, 18 Jul 2004 (UTC)