Category Archives: voice

Touch Thoughts: Apple's Handheld Strategy

I’m still on the RDF.
Apple‘s March 6, 2008 event was about enterprise and development support for its iPhone and iPod touch lines of handheld devices. Lots to think about.

(For convenience’s sake, I’ll lump together the iPod touch and the iPhone under the name “Touch,” which seems consistent with Apple’s “Cocoa Touch.”)

Been reading a fair bit about this event. Interesting reactions across the board.

My own thoughts on the whole thing.
I appreciate the fact that Phil Schiller began the “enterprise” section of the event with comments about a university. Though universities need not be run like profit-hungry corporations, linking Apple’s long-standing educational focus with its newly invigorated enterprise focus makes sense. And I had a brief drift-off moment as I was thinking about Touch products in educational contexts.

I’m surprised at how enthusiastic I get about the enterprise features. Suddenly, I can see Microsoft’s Exchange make sense.

I get the clear impression that even more things will come into place at the end of June than has been said by Apple. Possibly new Touch models or lines. Probably the famous 3G iPhone. Apple-released apps. Renewed emphasis on server technology (XServe, Mac OS X Server, XSan…). New home WiFi products (AirPort, Time Capsule, Apple TV…). New partnerships. Cool VC-funded startups. New features on the less aptly named “iTunes” store.

Though it was obvious already, the accelerometer is an important feature. It seems especially well-adapted to games and casual gamers like myself are likely to enjoy games this feature makes possible. It can also lead to very interesting applications. In fact, the “Etch and Sketch” demo was rather convincing as a display of some core Touch features. These are exactly the features which help sell products.
Actually, I enjoyed the “wow factor” of the event’s demos. I’m convinced that it will energize developers and administrators, whether or not they plan on using Touch products. Some components of Apple’s Touch strategy are exciting enough that the more problematic aspects of this strategy may matter a bit less. Those of us dreaming about Android, OpenMoko, or even a revived NewtonOS can still find things to get inspired by in Apple’s roadmap.

What’s to come, apart from what was announced? No idea. But I do daydream about all of this.
I’m especially interested in the idea of Apple Touch as “mainstream, WiFi, mobile platform.” There’s a lot of potential for Apple-designed, WiFi-enabled handhelds. Whether or not they include a cellphone.
At this point, Apple only makes five models of Touch products: three iPod touches and two iPhones. Flash memory is the main differentiating factor within a line. It makes it relatively easy to decide which device to get but some product diversity could be interesting. While some people expect/hope that Apple will release radically new form factors for Touch devices (e.g., a tablet subnotebook), it’s quite likely that other features will help distinguish Apple’s Touch hardware.
Among features I’d like to get through software, add-ons, or included in a Touch product? Number of things, some alluded to in the “categories” for this post. Some of these I had already posted.

  • Quality audio recording (to make it the ideal fieldwork audio tool).
  • eBook support (to compete with Amazon’s Kindle).
  • Voice support (including continuous dictation, voice interface…).
  • Enhanced support for podcasting (interacting with podcasts, sending audio/video responses…)
  • Video conferencing (been thinking about this for a while).
  • GPS (location will be big).
  • Mesh networking (a neat feature of OLPC’s XO).
  • Mobile WiMAX (unlikely, but it could be neat).
  • Battery pack (especially for long trips in remote regions).
  • Add-on flash memory (unlikely, but it could be useful, especially for backup).
  • Offline storage of online content (likely, but worth noting).
  • Inexpensive model (especially for “emerging markets”).
  • Access to 3G data networks without cellular “voice plan” (unlikely, but worth a shot).
  • Alternative input methods (MessagEase, Graffiti, adaptive keyboard, speech recognition…).
  • Use as Mac OS X “host” (kind of like a user partition).
  • Bluetooth/WiFi data transfer (no need for cables and docks).
  • MacBook Touch (unlikely, especially with MacBook Air, but it could be fun).
  • Automatic cell to VoIP-over-WiFi switching (saving cell minutes).

Of course, there are many obvious ones which will likely be implemented in software. I’m already impressed by the Omni Group’s pledge to develop a Touch version of their flagship GTD app.

Free As In Beer: The Case for No-Cost Software

To summarize the situation:

  1. Most of the software for which I paid a fee, I don’t really use.
  2. Most of the software I really use, I haven’t paid a dime for.
  3. I really like no-cost software.
  4. You might want to call me “cheap” but, if you’re developing “consumer software,” you may need to pay attention to the way people like me think about software.

No, I’m not talking about piracy. Piracy is wrong on a very practical level (not to mention legal and moral issues). Piracy and anti-piracy protection are in a dynamic that I don’t particularly enjoy. In some ways, forms of piracy are “ruining it for everyone.” So this isn’t about pirated software.

I’m not talking about “Free/Libre/Open Source Software” (FLOSS) either. I tend to relate to some of the views held by advocates of “Free as in Speech” or “Open” developments but I’ve had issues with FLOSS projects, in the past. I will gladly support FLOSS in my own ways but, to be honest, I ended up losing interest in some of the most promising projects out there. Not saying they’re not worth it. After all, I do rely on many of those projects But in talking about “no-cost software,” I’m not talking about Free, Libre, or Open Source development. At least, not directly.

Basically, I was thinking about the complex equation which, for any computer user, determines the cash value of a software application. Most of the time, this equation is somehow skewed. And I end up frustrated when I pay for software and almost giddy when I find good no-cost software.

An old but representative example of my cost-software frustration: QuickTime Pro. I paid for it a number of years ago, in preparation for a fieldwork trip. It seemed like a reasonable thing to do, especially given the fact that I was going to manipulate media files. When QuickTime was updated, my license stopped working. I was basically never able to use the QuickTime Pro features. And while it’s not a huge amount of money, the frustration of having paid for something I really didn’t need left me surprisingly bitter. It was a bad decision at that time so I’m now less likely to buy software unless I really need it and I really know how I will use it.

There’s an interesting exception to my frustration with cost-software: OmniOutliner (OO). I paid for it and have used it extensively for years. When I was “forced” to switch to Windows XP, OO was possibly the piece of software I missed the most from Mac OS X. And as soon as I was able to come back to the Mac, it’s one of the first applications I installed. But, and this is probably an important indicator, I don’t really use it anymore. Not because it lacks features I found elsewhere. But because I’ve had to adapt my workflow to OO-less conditions. I still wish there were an excellent cross-platform outliner for my needs. And, no, Microsoft OneNote isn’t it.

Now, I may not be a typical user. If the term weren’t so self-aggrandizing, I’d probably call myself a “Power User.” And, as I keep saying, I am not a coder. Therefore, I’m neither the prototypical “end user” nor the stereotypical “code monkey.” I’m just someone spending inordinate amounts of time in front of computers.

One dimension of my computer behavior which probably does put me in a special niche is that I tend to like trying out new things. Even more specifically, I tend to get overly enthusiastic about computer technology to then become disillusioned by said technology. Call me a “dreamer,” if you will. Call me “naïve.” Actually, “you can call me anything you want.” Just don’t call me to sell me things. 😉

Speaking of pressure sales. In a way, if I had truckloads of money, I might be a good target for software sales. But I’d be the most demanding user ever. I’d require things to work exactly like I expect them to work. I’d be exactly what I never am in real life: a dictator.

So I’m better off as a user of no-cost software.

I still end up making feature requests, on occasion. Especially with Open Source and other open development projects. Some developers might think I’m just complaining as I’m not contributing to the code base or offering solutions to a specific usage problem. Eh.

Going back to no-cost software. The advantage isn’t really that we, users, spend less money on the software distribution itself. It’s that we don’t really need to select the perfect software solution. We can just make do with what we have. Which is a huge “value-add proposition” in terms of computer technology, as counter-intuitive as this may sound to some people.

To break down a few no-cost options.

  • Software that came with your computer. With an Eee PC, iPhone, XO, or Mac, it’s actually an important part of the complete computing experience. Sure, there are always ways to expand the software offering. But the included software may become a big part of the deal. After all, the possibilities are already endless. Especially if you have ubiquitous Internet access.
  • Software which comes through a volume license agreement. This often works for Microsoft software, at least at large educational institutions. Even if you don’t like it so much, you end up using Microsoft Office because you have it on your computer for free and it does most of the things you want to do.
  • Software coming with a plan or paid service. Including software given by ISPs. These tend not to be “worth it.” Yet the principle (or “business model,” depending on which end of the deal you’re on) isn’t so silly. You already pay for a plan of some kind, you might as well get everything you need from that plan. Nobody (not even AT&T) has done it yet in such a way that it would be to everyone’s advantage. But it’s worth a thought.
  • “Webware” and other online applications. Call it “cloud computing” if you will (it was a buzzphrase, a few days ago). And it changes a lot of things. Not only does it simplify things like backup and migration, but it often makes for a seamless computer experience. When it works really well, the browser effectively disappears and you just work in a comfortable environment where everything you need (content, tools) is “just there.” This category is growing rather rapidly at this point but many tech enthusiasts were predicting its success a number of years ago. Typical forecasting, I guess.
  • Light/demo versions. These are actually less common than they once were, especially in terms of feature differentiation. Sure, you may still play the first few levels of a game in demo version and some “express” or “lite” versions of software are still distributed for free as teaser versions of more complete software. But, like the shareware model, demo and light software may seem to have become much less prominent a part of the typical computer user’s life than just a few years ago.
  • Software coming from online services. I’m mostly thinking about Skype but it’s a software category which would include any program with a desktop component (a “download”) and an online component, typically involving some kind of individual account (free or paid). Part subscription model, part “Webware companion.” Most of Google’s software would qualify (Sketchup, Google Earth…). If the associated “retail software” were free, I wouldn’t hesitate to put WoW in this category.
  • Actual “freeware.” Much freeware could be included in other categories but there’s still an idea of a “freebie,” in software terms. Sometimes, said freeware is distributed in view of getting people’s attention. Sometimes the freeware is just the result of a developer “scratching her/his own itch.” Sometimes it comes from lapsed shareware or even lapsed commercial software. Sometimes it’s “donationware” disguised as freeware. But, if only because there’s a “freeware” category in most software catalogs, this type of no-cost software needs to be mentioned.
  • “Free/Libre/Open Source Software.” Sure, I said earlier this was not what I was really talking about. But that was then and this is now. 😉 Besides, some of the most useful pieces of software I use do come from Free Software or Open Source. Mozilla Firefox is probably the best example. But there are many other worthy programs out there, including BibDesk, TeXShop, and FreeCiv. Though, to be honest, Firefox and Flock are probably the ones I use the most.
  • Pirated software (aka “warez”). While software piracy can technically let some users avoid the cost of purchasing a piece of software, the concept is directly tied with commercial software licenses. (It’s probably not piracy if the software distribution is meant to be open.) Sure, pirates “subvert” the licensing system for commercial software. But the software category isn’t “no-cost.” To me, there’s even a kind of “transaction cost” involved in the piracy. So even if the legal and ethical issues weren’t enough to exclude pirated software from my list of no-cost software options, the very practicalities of piracy put pirated software in the costly column, not in the “no-cost” one.

With all but the last category, I end up with most (but not all) of the software solutions I need. In fact, there are ways in which I’m better served now with no-cost software than I have ever been with paid software. I should probably make a list of these, at some point, but I don’t feel like it.

I mostly felt like assessing my needs, as a computer user. And though there always are many things I wish I could do but currently can’t, I must admit that I don’t really see the need to pay for much software.

Still… What I feel I need, here, is the “ultimate device.” It could be handheld. But I’m mostly thinking about a way to get ideas into a computer-friendly format. A broad set of issues about a very basic thing.

The spark for this blog entry was a reflection about dictation software. Not only have I been interested in speech technology for quite a while but I still bet that speech (recognition/dictation and “text-to-speech”) can become the killer app. I just think that speech hasn’t “come true.” It’s there, some people use it, the societal acceptance for it is likely (given cellphone penetration most anywhere). But its moment hasn’t yet come.

No-cost “text-to-speech” (TTS) software solutions do exist but are rather impractical. In the mid-1990s, I spent fifteen months doing speech analysis for a TTS research project in Switzerland. One of the best periods in my life. Yet, my enthusiasm for current TTS systems has been dampened. I wish I could be passionate about TTS and other speech technology again. Maybe the reason I’m notis that we don’t have a “voice desktop,” yet. But, for this voice desktop (voicetop?) to happen, we need high quality, continuous speech recognition. IOW, we need a “personal dictation device.” So, my latest 2008 prediction: we will get a voice device (smartphone?) which adapts to our voices and does very efficient and very accurate transcription of our speech. (A correlated prediction: people will complain about speech technology for a while before getting used to the continuous stream of public soliloquy.)

Dictation software is typically quite costly and complicated. Most users don’t see a need for dictation software so they don’t see a need for speech technology in computing. Though I keep thinking that speech could improve my computing life, I’ve never purchased a speech processing package. Like OCR (which is also dominated by Nuance, these days) it seems to be the kind of thing which could be useful to everyone but ends up being limited to “vertical markets.” (As it so happens, I did end up being an OCR program at some point and kept hoping my life would improve as the result of being able to transform hardcopies into searchable files. But I almost never used OCR (so my frustration with cost-software continues).)

Ah, well…

Voice and Empathy

Full disclosure. I do surveys. On the phone. For a marketing research firm.

No, no! Not a telemarketing firm! A research firm which uses survey results to improve the quality of the service offered by a client. Huge difference.

No, you most likely have not hung up on me. Very few people have done so and the readership of this blog is not such that it would be even remotely likely that you, dear reader, could be one of those few respondents who did hang up on me.

Why do I do it? Well, yes, it’s a job. A summer job, to be precise. But I could be doing (and have been doing) any number of other jobs. Yet, as an ethnographer, I felt compelled to give surveys a try. And I’m glad I did.

I actually did phone surveys as a summer job in 2005. Did it for the very reason that, while teaching ethnographic topics, I had been comparing ethnography with surveys even though I had never done surveys myself. Doing surveys on the phone seemed like a great way to learn more about those methods while getting an income at the same time. It worked like a charm.

Seems like I’m not the only one to think along those lines as I know at least two other anthropologists who are working at phone survey centres.

How do I like it? It’s really not so bad. The call centre where I work has a relatively nice atmosphere. More specifically, the supervisor and monitor provide exactly the type of supervision we need. Lots of positive feedback. Negative feedback is always given in a thoughtful manner. Both are very understanding and trusting with people who are serious at what they do. And there’s actually a notion of teamwork instead of competition.

I also learn a lot about myself. Not completely new things. Validation of what I thought of myself.

One is voice. My voice happens to be a valuable tool. Oh, I did notice this before. When I was in high school, some people kept telling me that I should become a news anchor or radio announcer because of my voice. The fact that I still had more of a European accent probably counted but it also had to do with actual voice quality. People thought I had a radio voice.

As shallow as it sounds, I do like my speaking voice. Not that it’s “the best voice ever” or that people stop me to tell me about my voice. But I do like the way I sound, overall. My voice used to be more pleasing than it is now. My GERD has had some detrimental effects on my voice. Especially my singing voice. But my voice is still pleasing enough that I receive positive feedback about it, on occasion.

The thing about my voice isn’t that it’s so good. But it’s a versatile voice and I do use it as a tool. It seems that I can adapt it to different situations, which is very useful.

Given my interests in acoustic anthropology, it should be no surprise that I think about voice fairly frequently. After all, I’m an audio guy. Like Steven Feld in Music Grooves, I wonder about the voice work of those women working for erotic phone lines. It would, in fact, be fascinating to do an ethnographic study of those workers, with a focus on voice work.

As anyone can guess, voice can also be quite important in teaching. I’m as much of an auditory learner as one can be. So, while teaching, I tend to use my voice for effect instead of other tools. It seems to work rather well with some people but I need to enhance my other teaching methods.

The other main thing doing phone surveys has taught me about myself is how empathetic I can get. Again, I knew this beforehand. I’m the kind of person who has a hard time watching a comedy about someone getting in all sorts of bad situations (“cringe” movies and such). I literally feel for them. When I watched The Sixth Sense, I felt the bullet enter my body.

Oh, sure. We’re all like that. But I get the feeling that my empathy levels are a bit extreme, at times.

Hannah Arendt would probably have said some negative things about this “personality trait” of mine. But I’ve learned to accept it.

What does this have to do with doing surveys on the phone? Quite a bit, actually. There are projects on which I can be very productive, mostly because of empathy. People hear that I care. Because I do care. A few other projects, I’m almost unable to do because of empathy. I need to get the feeling that those surveys can actually help improve the service people get. And I loathe being annoying to people.

On almost every survey I do at my current workplace, I can be very empathetic and it works very well. But I just worked on a project which was clearly annoying to respondents and it made me shrivel. The effect was quite intense. I had to take a long walk on my way back from work because I had realised something important about myself.

Hence this blog entry.

iPhone Wishlist

Yeah, everybody’s been talking about the iPhone. It’s last week’s story but it can still generate a fair bit of coverage. People are already thinking about the next models.

Apple has most of the technology to build what would be my dream handheld device but the iPhone isn’t it. Yet.

My wishful thinking for what could in fact be the coolest handheld ever. Of course, the device should have the most often discussed features which the iPhone currently misses (Flash, MMS, chat…). But I’m going much further, here.

  • Good quality audio recording (as with the recording add-ons for the iPod 5G).
  • Disk space (say, 80GB).
  • VoIP support (Skype or other, but as compatible as possible).
  • Video camera which can face the user (for videoconference).
  • Full voice interface: speech recognition and text-to-speech for dialing, commands, and text.
  • Handwriting recognition.
  • Stylus support.
  • Data transfer over Bluetooth.
  • TextEdit.
  • Adaptive technology for word recognition.
  • Not tied to cellular provider contract.
  • UMA Cell-to-WiFi (unlicensed mobile access).
  • GPS.
  • iLife support.
  • Sync with Mac OS X and Windows.
  • Truly international cellular coverage.
  • Outliner.
  • iWork support.
  • Disk mode.
  • Multilingual support.
  • Use as home account on Mac OS X “host.”
  • FrontRow
  • USB and Bluetooth printing.
  • Battery packs with standard batteries.

The key point here isn’t that the iPhone should be a mix between an iPod and a MacBook. I’m mostly thinking about the fact that the “Personal” part of the “PC” and “PDA” concepts has not come to fruition yet. Sure, your PC account has your preferences and some personal data. Your PDA contains your contacts and to-do lists. But you still end up with personal data in different places. Hence the need for Web apps. As we all know, web apps are quite useful but there’s still room for standalone applications, especially on a handheld. It wouldn’t take much for the iPhone to be the ideal tool to serve as a “universal home” where a user can edit and output files. To a musician or podcaster, it could become the ideal portable studio.

But where the logical step needs to be taken is in “personalization.” Apparently, the iPhone’s predictive keyboard doesn’t even learn from the user’s input. Since the iPhone is meant to be used by a single individual, it seems quite strange that it does not, minimally, adapt to typed input. Yet with a device already containing a headset it seems to me that speech technologies could be ideal. Full-text continuous speech recognition already exists and what it requires is exactly what the iPhone could provide: adaptation to a user’s voice and speech patterns. Though it may be awkward for people to use a voice interface in public, cellphones have created a whole group of people who seem to be talking to themselves. 😉

Though very different from speech recognition, text-to-speech could integrate really well with a voice-driven device. Sharing the same “dictionaries” across all applications on the same device, the TTS and SR features could be trained very specifically to a given user. While screens have been important on computers for quite a while, voice-activated computers have been prominent in science-fiction for probably as long. The most common tasks done on computers (writing messages, making appointments, entering data, querying databases…) could all be done quite effectively through a voice interface. And the iPhone could easily serve as a voice interface for other computers.

Yes, I’m nightdreaming. It’s a good way to get some rest.

Professors and Online Ethnography

Fellow anthropologist Michael Wesch (of The Machine Is Us/ing Us fame) posted about a video that the The Chronicle of Higher Education has released about his own digital ethnography projects.

For those who don’t know, The Chronicle is a well-known U.S. publication aimed primarily at university and college professors. It contains news and job announcements irrespective of disciplinary boundaries. A bit like the CAUT/ACPPU Bulletin here in Canada.

The video itself is journalistic in tone and does pay lipservice to the challenges of online research. I like the fact that we get to hear one of Wesch’s students, known as ThePoasm on YouTube. But, overall, the video does little to give voice to the people involved, apart from Wesch himself. The lack of student focus is unsurprising as The Chronicle is mostly concerned with faculty members. But there could have been more talk about the academic, disciplinary, institutional, and pedagogical implications of Wesch’s projects.

Maybe I’m just jealous of Wesch for being able to undertake those projects in the first place. Anyone wants to podcast/vidcast with me? 😉

Googely Voice

Neat new service.

GOOG-411 offers free directory assistance – Lifehacker

Not available in Montreal, but quite useful. Apparently better than Free-411.

The speech recognition and speech synthesis are quite good. In fact, when I was working in speech, such a service was pretty much the main example we used for the need for speech research. With the prominence of cellphones in many different parts of the world, I still think that speech is a field in which technological advancements can have very interesting effects.

iRiver H120 (Digital Audio Jukebox)

Recently purchased a brand new iRiver H120 with remote control on eBay from OutletMP3. Paid 132.50$ plus 18$ shipping. Also purchased a 3-year warranty through SquareTrade for 16$.
Item arrived as described, with both the European power adapter (in the original box) and a North American power adapter (in the shipping box). The remote control is included in the package but is outside of the original box. OutletMP3 sells those iRiver H120 devices with or without remote control (usually at about the same price).
Yes. “Would do business with OutletMP3 again.” (As it turns out, they sell iriver products quite frequently on eBay and they have an eBay store with “Buy It Now” iRiver H120 devices without remote for 150$ each.)
The best things about this device are its recording features. Those iRiver H1x0 models can record uncompressed sound in WAV format at 16bit with a sampling rate of 48 kHz (so-called “DAT quality”), 44.1 kHz (so-called “CD-quality”), or lower (“FM-quality,” “voice quality”). It also records directly to MP3 files (with the official firmware) in a variety of encoding settings (up to 320 kbps). It has an internal microphone for voice dictation as well as an input for external microphone, analog line in, or optical in.
The box includes a surprisingly decent lavaliere-style monophonic microphone. Not an excellent microphone in any way but clearly better than one might expect (though Laith Ulaby had told me that this microphone was decent).

In terms of operation, the unit has some strengths. The overall interface is much less convenient than that of the iPod, say, but the battery lasts longer than most iPods (for playback). The iRiver H120’s remote has a small LCD screen which shows enough information for most needs making it possible for me to keep the H120 in my pant pocket and operate the device with the remote. While, among portable players, only the iPod has native support for AAC and lossless formats, iRiver players support Ogg Vorbis and WMA. Haven’t done anything in Ogg format yet but it might be an interesting option (though it does make files less compatible with other players).

Apart from navigation and interface, the main differences with my previous iPod 2G have to do with iTunes integration. The iPod‘s synchronization with iTunes made it rather convenient to create and update playlists or to transfer podcasts. iRiver’s models may not be used in the same fashion. However, the iRiver H120 can in fact be used with iTunes through a plugin meant for Archos players. However, this plugin seems to have some problems with a few files (probably because of invalid characters like ‘/’ and ‘:’ in filenames), generates non-working playlists on Mac OS X, and puts all filed in an “Artist/Album” hierarchy which makes iRiver navigation more complicated.

What surprised me somewhat was that the H120, a USB 2.0 device, works perfectly well with my old iBook (Dual USB) which only has USB 1.1 ports. No need for special drivers and the device then works pretty much like a (20GB) USB drive. Since the iRiver H120 works as a USB drive, it’s easy to transfer files to and from the device (contrary to the iPod which makes somewhat more difficult). All audio files can be put at the root level on the iRiver and audio recordings made on the iRiver are in the “RECORD” folder at the root level of the drive. While the iBook’s USB 1.1 ports are much slower than USB 2.0 ones, they do the job well enough for my needs. (Will be going back to my entry-level emachines H3070 in a few days.) A 400 MB file recorded on the iRiver (about 40 minutes of 16 bit stereo sound at 44.1 kHz) transferred to the iBook through USB 1.1 in less than ten minutes. Slow, but bearable. My old iPod used a Firewire 400 (aka IEEE 1394 or i.Link) connection which is about the same speed as USB 2.0 in most conditions. My entry-level emachines desktop has both USB 2.0 and Firewire 400 ports (thanks to an inexpensive Firewire card).

Was thinking about putting Rockbox on the H120 but SquareTrade tells me that it may void their warranty, which would be an inconvenient. The Rockbox has some neat features and seems safe enough to use on “production machines,” but its features aren’t that compelling for me at this point.
The H120 has a radio (FM) tuner, which could be useful to some people but isn’t really a compelling feature for me. Haven’t listen to much radio in the past several years. Podcasts are soooo much better!

Speaking of podcasts… One of my reasons for purchasing this machine (instead of a more recent iPod) was the ease of recording. This is clearly not a professional recording device but the sound quality seems quite decent for my needs at this point. Should be using it to record lectures and distribute them as podcasts or “lecturecasts” (yeah, ugly name, sorry!). In my mind, educational podcasting can supplement lectures quite nicely. Have been to a few workshops and presentations on technology use in teaching and most people seem to agree that technology is no replacement for good pedagogy but that good pedagogy can be supplemented and complemented (if not complimented!) by interesting tools. Had been thinking about a recording iPod to integrate podcasts with course material. It would have been quite useful, especially in connection with iLife and iWork. But an iPod 5G (with video) is already much more expensive than my iRiver H120 and the add-ons to enable 44.1 kHz / 16 bit recording on the iPod are only now getting to market at a price almost half that of my brand new iRiver H120. Plus, though the iPod is well-integrated with iTunes on Windows, iLife and iWork applications are only available on Mac OS X 10.4 and, thus, will not run on the entry-level emachines H3070 which will become my primary machine again in a few days.
In other words, my ideal podcasting/lecturecasting solution is out of my reach at this point. And contrary to tenure-track faculty, lecturers and adjunct faculty get no technology budget for their own use.
Ah, well…

Still, my iRiver H120 will work fine as a recorder. Already did a few essays with voice and environmental sounds. The lavaliere microphone was quite convenient to record myself while taking a walk which sounds like an unusual activity but was in fact quite relaxing and rather pleasant. In terms of environmental sounds, the same microphone picked up a number of bird songs (as well as fan noises).
Among the things that distinguish the H120 from a professional recorder is the lack of a proper calibration mechanism. It’s not possible to adjust the recording levels of the two channels independently and it’s even not possible to adjust volume during recording. (There’s a guide offering some guidance on how to work within those constraints.) Quite unsurprisingly (for what is mostly an MP3 player) but also making the device less of a professional device, its jacks are 3.5 mm “stereo mini-plugs” (instead of, say, XLR jacks). For that matter, the iRiver H120 compares favourably to several comparably-priced MiniDisc recorders, even Hi-MD models. Did field research with a used ATRAC 4.0 MiniDisc recorder. That setup worked somewhat adequately but this iRiver H120 is much of an improvement for me.

Got a few pet peeves about the iRiver H120. For instance, it has no actual clock so recorded files do not carry a timestamp. A minor quibble, of course, but it would have been useful. The overall navigation is as awkward as that of my first MP3 device, the RioVolt (which also used iRiver firmware). One navigational issue is that navigating up and down in the folder hierarchy is done through the stop and play buttons instead of, say, using one of the three jog switches on the remote. Some functions only work when the device is stopped while others work while it’s playing. Switching from hard-disk playing to recording or to FM is a bit awkward and cumbersome. The unit takes a while to turn on and doesn’t really have a convenient sleep mode. While it is possible to resume playing on a track that has been stopped, this feature seems not to work every time. Fast forwarding rate (“scan speed”) is set in a menu instead of being dynamic as on the iPod. The device doesn’t support ratings or, really, descriptions (although Rockbox might be able to support those).

Also got a few well-appreciated features, apart from those stated above. The EQ and SRS presets are appropriate and relatively easy to use. Contrary to the iPod 2G it is possible to play files at a higher rate (increasing the “playback speed”) making it possible to listen to voice at a higher speech rate (and higher frequency). It’s also possible to delete files directly from the device.

At any rate, that’s already a long entry and experience with my H120 will probably push me to write more about the device.

Feel free to comment or send questions through email.