Japan

Audio cleanup?


While my Japanese class is going well so far (the pace is a bit slow, due in part to the overhead of community-college drop/add handling), I’ve found one serious annoyance: the audio CDs are crap.

There’s nothing wrong with the content; the material is presented clearly by native speakers, and the original mastering was well-done. Unfortunately, it was mastered for cassette tape, and the CDs were apparently converted from that format. How well was this done? Here’s a thousand words on the subject:

Situational Functional Japanese Audio CDs

That’s what it looked like when I loaded a track into Audacity. I can crank up the gain, but then the hiss becomes objectionable, and Audacity’s noise filter introduces some rather obnoxious artifacts, even at its gentlest setting. I’ll be using these CDs until March, so it’s worth a little time and money to me to get them cleaned up. Any recommendations for a good tool?

The cleansing power of Quartz


So my new Japanese class has started (lousy classroom, good teacher, reasonable textbook, nice group of students, unbelievably gorgeous teacher’s assistant (I will never skip class…)), and, as expected, the teacher is pushing us to master hiragana quickly. I did that quite a while ago, so while everyone else is trying to learn it from scratch, I can focus on improving my handwriting.

One thing she suggested was a set of flash cards. I had mine with me, and mentioned that they were available for a quite reasonable price at Kinokuniya. Her response was along the lines of “yes, I know, but nobody ever buys optional study materials; do you think you could photocopy them so I can make handouts?”

I could, but that wouldn’t be nearly as much fun as making my own set. The first step was finding a decent kana font. Mac OS X ships with several Unicode fonts that include the full Japanese kana and kanji sets, but they didn’t meet my needs: looking good at display sizes, and clearly showing the boundaries between strokes. I found Adobe Ryo Display Standard. TextEdit seems to be a bit confused about its line-height, but I wasn’t planning to create the cards in that app anyway.

How to generate the card images? Perl, of course, with the PDF::API2::Lite module. I could have written a script that calculated the correct size of cards to fill the page, but I was feeling lazy, so I wrote a 12-line script to put one large character in the middle of a page, loaded the results into Preview, set the print format to 16-up with a page border, and printed to a new PDF file. Instant flash cards.

For many people, this would be sufficient, but one of the things sensei liked about the cards I had brought was the numbers and arrows that indicated the correct stroke order. There was no lazy way to do this, so I used Adobe Acrobat’s drawing and stamping tools. The stamping tool lets you quickly decorate a PDF file with images in many formats, so I just modified my previous script to create PDF files containing single numbers, and imported them into the stamp tool. The line-drawing tool let me make arrows, although I couldn’t figure out a simple way to set my own line-width and have it remembered (1pt was too thin after the 16-up, and 2pt had too-big arrowheads).

So why is this post titled “the cleansing power of Quartz”? Because the one-per-page annotated output from Acrobat was more than six times larger than the same file printed 16-up from Preview. Just printing the original file back to PDF shrank it by a factor of four, which, coincidentally, is almost exactly what you get when you run gzip on it…

The final results are here.

Foothill Follies


I’ve made a lot of progress learning Japanese with the Rosetta Stone software, but I decided that it’s time to work on the area that it helps least with: speaking. Since I’m working in Palo Alto again, taking a class seemed like a good idea, and Foothill College seems to have a good program, with flexible hours. Sort of.

Their first-year class is offered in evenings, which will work great for me. The second-year class appears to only be available at 8am, three days a week, which is not so good. Even without taking into consideration the fact that I live 75 miles from the campus, I’m not much of a morning learner.

Worse, the Intermediate Conversation class that can be taken alongside the second-year class starts at 8pm on the same days. If this isn’t an error in the online course catalog, it’s pretty stupid. They don’t have the schedules for Winter and Spring online yet, so I suspect I’ll have to talk to someone in the department to find out if this is typical. If I can continue their program in the evenings, or even later in the morning, I will; otherwise, I may have to drop it after the first year.

History is written by the winners moonbats


While in the book store last week, I picked up A History of Japan, by Conrad Totman. I didn’t make it past the preface before the bullshit was too deep to wade through. Quoting:

Today we find ourselves at a point where the level of human exploitation of the ecosystem appears to be throwing the entire global biome into crisis. The Earth is now home to well over six billion people, but in fact this small planet's current biological production is not remotely capable of sustaining those people in the manner to which they are accustomed, much less the manner to which they aspire.

This, he says, is why he decided to write a book about Japanese history. Skimming ahead and checking the reviews, it appears his “ecological” approach to history taints the contents from cover to cover, coloring both which facts he chooses to include, and how he interprets them.

I have rarely felt the urge to return a book to the store based on its content, but a historian who so thoroughly injects his personal politics into the material simply isn’t worth reading.

6Sense, podcast edition


A Japanese-language online radio show I like, 6Sense, is published in an annoying way. They keep more than a month’s worth of archives online in MP3 format, but each episode is split into 60+ audio files, accessed through a Flash interface.

Examining the Flash app told me very little. Examining my Privoxy logs gave me the regular-but-unpredictable naming convention for the audio files, and a little more digging turned up the URL that the Flash app calls to get the list for a specific day. After that, I simply used wget to download the complete show… as 60+ MP3 files.

Knowing that someone had to have written a Perl script to concatenate MP3 files, I googled and found mp3cat, part of Johan Vroman’s mp3cut package. Making the results into a podcast required the use of another Perl script, podcastamatic, and a web server to host the results. I just turned on web sharing on my Mac, moved the files into ~/Sites, and typed the appropriate URL into iTunes.

With the latest version, iTunes supports podcasts directly, but the integration is kind of peculiar, and carries over to the iPods. Both correctly track what you’ve listened to, and where you left off in the middle of an episode, but otherwise they’re not treated like regular audio tracks.

In iTunes, if you finish listening to one episode of a podcast, instead of moving on to the next episode, it skips to the current episode of the next podcast. On iPods, there’s no concept of “next” at all; when a podcast ends, it just stops playing. If you’ve set it to repeat, it repeats the episode you just heard. Unfortunately, not all podcasts are an hour long; some are quite short, such as ナナライフ, which averages about 90 seconds.

Ironically, the least sophisticated iPod handles podcasts the best right now. The iPod Shuffle just treats them as sound files, and syncs up the play count when you connect it to your computer. When you delete an episode from iTunes, it’s deleted from your Shuffle. Not perfect, but better for long drives (and I’m driving 150 miles a day right now, as I settle in to my new job…).

Sake cheat card


After trying out a few types of sake and doing a little reading on the subject, I decided to gather up all of the useful information commonly printed on labels and menus, and arrange it on a double-sided 3x5 card. It was as much an excuse to play with the new version of Adobe Illustrator as anything else, but it should come in handy the next time I try to figure out what to buy at Mitsuwa.

It does this and, it has and they are for ten thousand goodwill palpus う.


[Note the correction in red; see the update section for details on my mistake]

This is, of course, a bad translation of written Japanese, created by the SYSTRAN query tool supplied with Mac OS X Tiger. The original phrase was 「このはしは、もちのれんしゅうです。」, and it comes from the items shown below (the small white piece reads 「はしのおけいこ」, by the way, which is also the name of the complete product):

hashi no okeiko

The correct translation is left as an exercise for the reader. :-)

Update: the correct transcription would help. I goofed. If you click on the picture, you’ll see that the eighth character is actually 方, not 万. This makes the sentence so easy to translate that even SYSTRAN produces something mostly comprehensible. (or not. I forgot that I’d converted れんしゅう into the correct kanji; without that, SYSTRAN’s effort is still pretty bad)

The reason I wrote the wrong kanji is that, as printed on the chopstick, it looks like three strokes rather than four, and the three-stroke kanji it most closely resembles is extremely common. I noticed the visual difference, but disregarded it when I couldn’t find a better three-stroke match.

Update: to make up for my error, here’s another pair of training chopsticks, that teach a different skill…

Training chopsticks

Practical Japanese Vocabulary


Today’s Rosetta Stone lesson could prove useful, in the right circumstances. Or perhaps it’s intended as a moral lesson…

perfectly innocent dialogue, innuendo inferred

Transcribed with pop-up furigana, the four sentences are:

彼女一口飲んでいます。
彼女はゴクゴク飲んでいます。
彼女吸っています。
彼女吹いています。

“Need a clue, take a clue,
 got a clue, leave a clue”