Skip to main content

How does a programming field linguist come into existence?

Now after 20 years of education it's been nearly 7 months since my last assignment was turned in and I have been released to do my work. Hence the absence of any sort of light reading or content creation since I discovered programming and linguistics in the same summer, the summer I turned 16. I could barely check into PhD Comics and the Language Log once every 2 years...

So you might ask, how does a programming field linguist become a programming field linguist, does it co-evolve, or is there no connection? I argue that the connection is tied to the root of the two disciplines. We could argue that the common ancestor is Chomsky, but I prefer to think in more abstract terms.

To better understand this cross-breed, let us explore this path in a particular individual. Individual c. finds out about comparitive linguistics and programming decentralized agent-based systems in the same summer at the University of Maine, Math and Science UpWard Bound Summer Program for high school kids. She decides to do her undergrad in linguistics at Concordia, where Dr. Hale gives her a RedHat CD, and years of green text and compiling dependancies fly by. Bending the rules she takes Field Methods her first year. With only Ling 101 and Phonology under her belt, she is confused and awed by the fourth-years who seem to ask the informant sentences that she would never have figured out. She starts practicing on her roommate (who hails from Lahore, Pakistan) gets the hang of it, writes a (kick-ass) final paper on Mongolian locatives and instrumentals and manages to be 1 of the three A+ in the course. She takes this as a sign that she has met her destiny... Dr. Reiss suggests she do her grad school at University of Delaware, where they have some pretty well known field linguists. She graduates, gets a job as a Linguist at a multinational Text To Speech and Automatic Speech Recognition company in Montreal on a 1 year work visa, learns Perl, and works until all linguists in the fishbowl are laid off due to a final push for profitability in the 4th quarter. She gets accepted for grad school, deferes for 1 year and moves to Pakistan again to gather data for her research. She goes to grad school, takes more field methods courses on two dialects of Malay, some Passamaquoddy and as well as on-site fieldwork in Romania, Korea, Austria and the Czech Republic. She gets her permanent residence and settles in Montreal where through Dr. Bergler, she finds out about GATE, she also finds out about Jurafsky and Martin (second edition), and Protege. Being (apparently) the only grad student in the Computer Science and Software Engineering department with Perl on her CV, she TA's two Software Engineering courses. To help out an old friend from undergrad she works as a Linguist durring the summer at a Knowledge Engineering Startup (bound by another non-disclosure agreement she won't tell you what she did). The GATE training comes to town and she really learns Eclipse and some practical Software Engineering. The flood gates open, she ditches her Perl and Python and PHP scripting in favor of Groovy so she can script GATE to her needs... and she learns how to program Android to make an interface for GATE pipelines, to auto-summarize her pdfs and make "audiobooks" that don't read try to read damn data tables or trees to her, and more importantly read with comprehensible prosody. It's a beginning, it took her a lot of reading and highlighting to get here, and there is still more to do, but the flood is on the horizon :)

Are there others working these two fields? The answer is as sure as the common root is sure: yes. Where are they? Very busy plowing and cultivating the rocks in their two fields, perhaps stamped out as weeds, or perhaps they have already moved on to more fertile ground...

Three cheers for Engineering and Infrastructure to connect the neighboring fields!


Popular posts from this blog

10.6.8 update spells Joy for Minimacs everywhere

If, after updating to 10.6.8 you get into a reboot loop, never fear the update is the same as every other update, except there is a step involving replacing the kernel.

This is very easy to do if you either (a) download it and save it on your Minimac before you update to 10.6.8, or (b) you have a mac formated USB key that you can copy it onto after your Minimac starts looping.

Here is the super-condensed minimal effort path to get you into Minimac heaven... (no not a dead Minimac, a running one), at least until Lion comes out.

On another computer (preferably a Mac or Ubuntu)
Download the legacy kernel[mirror]Put it on a Mac formated USB key * On the Reboot Looping Minimac
Hold down Shift as you bootAt the boot loader screen type (once you start typing it will apear in black letters on the bottom of the screen)  recovery=yes, -x Once it has finished loading, plug in the USB keyCopy the legacy_kernel-10.6.8.bz2 to your MinimacDouble click on it to unzip itMove the legacy_kernel-10.6.8 to …

English Noun Incorporation?

I was at a talk today with some Ojibwe data where invariably the claim that "English doesn't have incorporation" or at least incorporation of objects came up. We have "vacume clean" but generally we only incorporate the instrument. I remember a similar discussion coming up a few years ago in 2007 and I asked myself about apple picking. My colucators said, sure, but you can't say apple pick right? I thought about it a bit and came up with a linear string of words that might get google results. I remember I searched for "we apple picked" and found a few results, indicating to me that some people say it, generally when discussing their weekends. So, having my Android with me at the talk I googled again. This time I found a lot more examples than before, 394 to be exact, all of the first page clear examples with native speakers, speaking naturally.

I've heard this claim can be traced back to Baker 1988. When I got home I googled the claim "…