Wednesday, January 31, 2007

Digging Google Earth

I've played with Google Earth before now, but I'm digging it once again. I'm working on a piece about France's Savoie, and I wanted to get a sense of the area. No problem. Type in Chignin, France and you can see the village, the local lakes, and even the vineyards.

But then I discovered the tilt slider. Adjust one way or the other, and Google Earth creates a 3D view of the terrain. It's one thing to see there are mountains in the Savoie. Quite another to see how tall and steep the mountains are. The tool isn't a perfect view: I can't see Mont Blanc off in the distance. But, still it's way cool.

Lots of folks have written about Google Earth, and in particular the community aspects, where the internet's population tags items visible in Google Earth (including jets caught midflight and other ephemera). So I'm late to the game. But still, if you haven't pulled it down yet, take a look when you have some hours to kill.

Monday, January 29, 2007

Tycho Articulates a Problem With Scores

Everyone knows I dislike wine scores. Tycho, from Penny-Arcade, articulates one of the things I dislike about them, even if he's talking about video game reviews.

I often find that I agree with the bodytext of a given piece and then once I arrive at the distilled score there's no way to derive it from the original work. This is because while the body includes useful, interesting information and perspective the score is often used to cement the review in the overall editorial philosophy. Scores are ridiculous and facile, and they are insulting to the reader - they say in clear terms that you are incapable of discerning the meaning of the text above or below.

Thursday, January 25, 2007

Neologism: Who-do

I've decided that getting who and whom correct in a sentence should be called "who-do," which rhymes with "judo." Go forth and make it so.


These Are My People

I self-identify as a nerd, though I'm by no means as nerdy as other nerds I know. Still, I found this Nerd of the Week episode, about Penny Arcade Expo, touching and sympathetic.

Grammar Girl Profiled

CNN profiles Mignon Fogarty, aka Grammar Girl.

Wednesday, January 24, 2007

Eternity II

Christopher Monckton made puzzle news in 1999 when he released his Eternity puzzle, which was uninteresting in every way except for the million-pound prize attached to it. He confidently imagined that it would take purchasers years to solve it, only to have the solution (one of a few, probably, since he didn't design it for a unique solution) handed to him 18 months later.

He will soon be releasing Eternity II, a different puzzle with another $2 million prize. It looks like a straightforward edge-matching problem, but who knows.

New Cliches For Old!

Melissa mentioned this defective yeti campaign to find new cliches for old standbys. Cliches are the bane of interesting writing, but not if you swap in new and exciting variants.

I'll put some thought into my favorites. What are yours?

Monday, January 22, 2007

Average Joe Heroes

Slate has an article and slide show about comics that look at daily lives with and around superpowers, a nod to the reemergence of NBC's Heroes after its holiday slumber. Living in Infamy intrigues me the most of their choices, though Astro City, also mentioned, has a large following.

Saturday, January 20, 2007

You're Know You Are

Josh of Comics Curmudgeon spotted a great example of grammatical nonsense in the wild. The Pluggers panel at the bottom of the page starts with "You're know you are really a plugger..." It's easy to see how this came about: Originally, it was "You're a plugger if" and then the writer thought about "You know you are" and no one caught the problem.

The panel's been fixed on the website, favoring the stilted "You know you are" construction instead of the shorter, but more informal, "You're a plugger if".


Thursday, January 18, 2007

Ze Frank Discusses His Wii

Videoblogger Ze Frank shows off his Wii, mentioning some games that I've not yet seen.

Wednesday, January 17, 2007

Niagara Power Station Spelunking

Spotted on the NPL mailing list: a trio goes spelunking into the bowels of an old hydroelectric power station at Niagara Falls and then writes up the adventure in detail, complete with pictures.

Note: strong language within.

How Do You Say "Dour"?

I discovered the "A Way With Words" radio show, and I listened to a recent episode as a podcast. One of the callers described losing a bet with her sister about the preferred pronunciation of the word "dour." I, like the caller, have always said it as a rhyme with "sour" or "hour." But the favored pronuncation rhymes with "lure." I should note that my laptop's dictionary and list the rhymes-with-hour pronunciation as an acceptable alternate pronunciation, but rhymes-with-lure sits in first place.

How do you say "dour?"

Friday, January 12, 2007

Strunk & Whiteisms For The Day

In idle moments, I'm rereading Strunk & White's Elements of Style, and a couple of excerpts caught my eye this evening.

In their suggestion to prefer the standard to the offbeat when choosing words, they write: "By the time this paragraph sees print, psyched, nerd, ripoff, dude, geek, and funky will be the words of yesteryear." Oops.

And of the language of advertising, and its pervasive influence on culture: "You will also, in all probability, want to try writing that way, using that language. You do at your peril, for it is the language of mutilation." That's my new favorite phrase: "the language of mutilation."

Who vs. Whom

A recent exercise in English 3200 (reviewed here) offered a great clarification of "whom" vs. "who." When either appears in a noun clause (a group of words with a subject and verb that acts as a noun in a sentence), you choose "who" or "whom" based on that word's role within the clause, not within the sentence. (Note that "whom" is the object form of "who," much as "him" is the object form of "he.")

"The prize will go to whoever donates the most money" is correct. "whoever donates the most money" is a noun clause that acts as the object of the preposition "to," but "whoever" is the subject of that clause, so you use that form. On the other hand "The prize will go to whomever I nominate" is correct because "whomever" is the object of the verb "nominate." It doesn't matter that the noun clause is an object.

Similarly, "Whomever I choose will get one million dollars" is correct. "Whomever I choose" as a clause acts as the subject of the sentence, but "Whomever" is the object of the verb "choose." "Whoever arrives here first will get one million dollars" is also correct because "Whoever" is the subject of the verb "arrives" within the clause that acts as the subject to the sentence.

There is a method to the madness.

Thursday, January 04, 2007

Piano-Hinged Dissections

Greg Frederickson owns the world of geometric dissections. Thankfully, he likes to map that world in exquisite detail. He just released his third book, covering piano-hinged dissections. I've known he's been working on it for a while, so I'm glad to see it's out, and I just added it to my "Books to Buy" list.

via Ed

Tuesday, January 02, 2007

Magic Hammers

Videoblogger ze frank is the master of extended metaphors. They're creative, they work, and they communicate. That's a golden combo in the world of colorful writing. His post today eloquently portrays the particular drunkenness brought about by Champagne.

Menu For Hope Raffle Program

As some of you know, I donated development work to the latest Menu For Hope raffle. I wrote the program that chooses winners for each raffle prize. Choosing a random name from a list does not pose a difficult challenge, but interpreting the higgledy-piggledy comments does. Ticket purchasers left a comment specifying the prizes they wanted to buy tickets for, and you can imagine that the human-entered text was all over the map. I looked at the list as the raffle progressed, and wrote a program that would attack based on what I saw as the most common types of comments.

I knew I'd have to clean up some of the data—I tested the program by proofreading its entire output to compare my interpretation of a line to my program's—but I was happy to see that my code correctly parsed about 90 per cent of the comments. If I were writing this again (or modifying it next year), I might add some code to handle the case where a donater types "UW 01" instead of "UW01." That extra space accounted for roughly half of the cleanup work I needed to do. I deliberately kept my code "stupid" since clever code often takes longer to debug, and I still would have had to proofread all the output.

Here are some of the lines the program correctly parsed:

 UW01, UW10, UW36, UE03, WB04
$40 on red. er, i mean, UW39 :)
Two UW17, one UW48 please
1 tkt each AP09, AP10, AP25, AP31, AP36, AP40, 4 tickets AP02

But it couldn't interpret:

2 each on UE13 UE03 UE04 UC08 UC07

For transparency's sake, here's the Java code of the most interesting classes. We haven't done the official run yet, but I've cleaned all the data, and the program does what I expect. I've had the vicarious thrill—Melissa and I exempted ourselves from the raffle—of seeing various sets of "winners" in each test run. I only know what a few of the prize codes actually mean, but I've seen some good friends get fantastic prizes in some of the runs. I'll either send the program to Pim to run, or we will just pick a moment after which the output will become the official results (I found some lines in the data that I could not fix, and she is asking for clarification from the donaters).

Main program

import java.util.ArrayList;
import java.util.Collections;
import java.util.HashMap;
import java.util.Iterator;
import java.util.List;
import java.util.Map;

/** Program for "drawing" raffle prizes for Menu for Hope. Not fancy, mostly
* just procedural.
* Attack strategy:
* Pull in lines from CSV.
* Parse each line to determine bidder, bid-on prizes, tix for each.
* Divide into pools.
* Draw randomly from each pool for winner.

public class MFHRaffle {

public static void main(String[] args) {
// divide args into buckets
// there's only one input arg (file) so we can size list in advance
List commandArgs = new ArrayList(args.length - 1);
String filename = null;
for (int i = 0; i < args.length; i++) {
if (args[i].startsWith("-")) {
} else {
filename = args[i];
System.out.println("Using file: " + filename);

if (commandArgs.contains("testrandomdraw")) {

boolean debug = false;
if (commandArgs.contains("debug")) {
debug = true;

MFHDataParser parser = null;
if (commandArgs.contains("csv")) {
parser = new CSVDataParser();
} else {
parser = new ExcelDataParser();

Map<String,List<String>> entries = parser.extractData(filename);
Map<String,Integer> prizeCounts = new HashMap<String,Integer>();
Map<String,String> prizeToWinner = new HashMap<String,String>();

//produce a sorted list
List<String> sortedPrizes = new ArrayList<String>(entries.keySet());

// for every entry in map, throw list to randomDraw
for (Iterator<String> prizeIt = sortedPrizes.iterator();
prizeIt.hasNext();) {
// drumroll please...
String prize =;
List<String> bidders= entries.get(prize);
prizeCounts.put(prize,new Integer(bidders.size() * 10));

String winnerEmail = randomDraw(bidders);

// tab-delimited output for fatemeh
System.out.println("********** TEXT ****************");
for (Iterator<String> prizeIt = sortedPrizes.iterator();
prizeIt.hasNext();) {
String prize =;
parser.getNameForEmail(prizeToWinner.get(prize)) +"\t"+

// html markup for brett
System.out.println("********** HTML *****************");
System.out.println("<table rules=\"rows\" >");
for (Iterator<String> prizeIt = sortedPrizes.iterator();
prizeIt.hasNext();) {
String prize =;
prizeCounts.get(prize) + "</td><td>" +
parser.getNameForEmail(prizeToWinner.get(prize)) +
"</td><td>" + prizeToWinner.get(prize) +


/** Simple method for proving that the random draw algorithm is sufficiently
* random.
* This test method creates a list, draws a random item from it 10000 times,
* and records the result.
private static void testRandomDraw() {
List<String> items = new ArrayList();
for (int i = 0; i< 100; i++) {

Map<String,Integer> counts = new HashMap();

// 100-item list. retrieve a random item 10000 times, and note result
for (int i = 0; i < 10000; i++) {
String retrieved = randomDraw(items);
if (!counts.containsKey(retrieved)) {
counts.put(retrieved,new Integer(1));
} else {
Integer curCount = counts.get(retrieved);
counts.put(retrieved, new Integer(curCount.intValue() + 1));

// print counts
for (Iterator<String> itemIt = items.iterator();itemIt.hasNext();) {
String next =;
System.out.println("Item: " + next + " Count: " + counts.get(next));

/** Makes a random draw from a List of strings */
private static String randomDraw(List<String> items) {
// precision loss, but we're not expecting any numbers greater than
int i = (int)Math.floor(Math.random() * (double)items.size());
return items.get(i);

Data parser base class

import java.util.ArrayList;
import java.util.HashMap;
import java.util.List;
import java.util.Map;

/** A class for parsing data files for MFH. This is an abstract class since
* it provides common functionality for subclasses. Each subclass can deal
* with a given file type (excel or csv)
public abstract class MFHDataParser {

private boolean debug=false;

private char[] delims = {',',' ','.',';'};

private Map<String,String> emailToName = new HashMap<String,String>();

protected void mapEmailToName(String email, String name) {
this.emailToName.put(email, name);

public String getNameForEmail(String email) {
return this.emailToName.get(email);

public abstract Map<String,List<String>> extractData(String filename);

/** Do our darndest to figure out what prizes a donator has mentioned on
* a given line. Note: Be sure to complain if we don't recognize a prize.
protected List<String> extractPrizes(String prizes, int amount)
throws NoPrizeFoundException {

String ucPrizes = prizes.toUpperCase().trim(); //for consistency's sake
if (isDebug()) {
System.out.println("Incoming prize string " + ucPrizes);

// basic strategy
// find two numbers followed by a delim (, space, ., eol, ;)
// back up and find two letters before it
// then back up (not into an earlier code) and find a #
// can't use java's regex abilities because i need to divide into larger chunks

// put that many copies into the list
// verify that size of list = donation /10. bark if not
// List = one of each real raffle ticket (5xUW03 -> 5 entries in List)

List<String> prizeList = new ArrayList<String>();
List<Integer> prizeCounts = new ArrayList<Integer>();

if (ucPrizes.length() < 4 ) {
throw new NoPrizeFoundException(ucPrizes);
} else if (ucPrizes.length() == 4) {
// exact count. easy case, but verify that it's legit

String prizeCode =
for (int i =1;i<= amount/10; i++) {
} else {
// in this case we need to walk through the list, divided it into
// chunks, and find the prize code in each
int chunkStart = 0;
for (int i = 0; i < ucPrizes.length();i++) {
if (i == ucPrizes.length() - 1 ||
isDelim(ucPrizes.charAt(i))) {
String curChunk = null;
if (isDelim(ucPrizes.charAt(i))) {
curChunk = ucPrizes.substring(chunkStart,i);

if (i == ucPrizes.length() - 1) {
curChunk = ucPrizes.substring(chunkStart,i+1);
if (curChunk.length() < 4) {
int prizeCodeOffset = findPrizeCodeInTextBlock(curChunk);
if (prizeCodeOffset == -1) {
continue; // not in this text block

String prizeCode =
curChunk.substring(prizeCodeOffset,prizeCodeOffset + 4);
int prizeCount = new Integer(
if (prizeCount == -1) {
prizeCounts.add(new Integer(1));
} else {
prizeCounts.add(new Integer(prizeCount));

chunkStart = i + 1;

// expand prize list as needed
if (prizeList.size() == amount /10) {
return prizeList; // if there are as many prizes as the amount
// would suggest, do one ticket each
} else if (prizeList.size() == 1) {
// create an expanded list that has one entry for each ticket
List<String> newPrizeList = new ArrayList<String>(amount/10);
for (int i = 0;i < (amount /10); i++) {
return newPrizeList;
} else {
// we have a mix of amounts and quantities
List<String> newPrizeList = new ArrayList<String>(amount/10);
for (int i = 0; i < prizeList.size();i++) {
Integer count = prizeCounts.get(i);
for (int j = 0; j < count.intValue(); j++) {
return newPrizeList;

protected int parseAmount(String amtString) {
return (int)(Double.parseDouble(amtString));

/** Takes a guess at the prize quantity in a given text block */
private int findPrizeQuantityInTextBlock(String chunk, int prizeOffset) {
// walk over the string looking for numbers, skipping the prize code
for (int i = 0; i < chunk.length(); i++) {
if (i >= prizeOffset && i < prizeOffset + 4) {

if (Character.isDigit(chunk.charAt(i))) {
if (i < chunk.length() - 1 &&
Character.isDigit(chunk.charAt(i+1))) {
// two-digit quantity
char[] digits = {chunk.charAt(i),chunk.charAt(i+1)};
return Integer.parseInt(new String(digits));
} else {
return Integer.parseInt(chunk.substring(i,i+1));

// one last check. some bidders wrote "TWO" instead of 2
if (chunk.indexOf("TWO") != -1) {
return 2;

return -1;

/** Returns the offset of something that looks like a prize code. */
private int findPrizeCodeInTextBlock(String chunk)
throws NoPrizeFoundException {

// look for 2 letters followed by 2 numbers => prize code
for (int i = 3; i < chunk.length(); i++) {
if (Character.isDigit(chunk.charAt(i)) &&
Character.isDigit(chunk.charAt(i - 1)) &&
Character.isLetter(chunk.charAt(i - 2)) &&
Character.isLetter(chunk.charAt(i - 3)) ) {
return i - 3;
return -1;

protected boolean isDebug() {
return this.debug;

public void setDebug(boolean debug) {
this.debug = debug;

private boolean isDelim(char c) {
for (int i = 0; i < this.delims.length; i++) {
if (c == delims[i]) {
return true;
return false;


Excel parser


import java.util.ArrayList;
import java.util.Collections;
import java.util.HashMap;
import java.util.Iterator;
import java.util.List;
import java.util.Map;

import jxl.*;

/** Subclass of MFHDataParser that knows how to parse excel docs.

public class ExcelDataParser extends MFHDataParser {

public Map<String,List<String>> extractData(String filename) {
Map<String,List<String>> retVal = new HashMap<String,List<String>>();
try {
Workbook wkbk = Workbook.getWorkbook(new File(filename));
Sheet sheet = wkbk.getSheet(0);
for (int i = 0; i < sheet.getRows(); i++) {
if (i == 0) { continue; }// skip headers
String name = sheet.getCell(0,i).getContents();
String email = sheet.getCell(1,i).getContents();
String date = sheet.getCell(2,i).getContents();
if (isDebug()) {
System.out.println("amount: " + sheet.getCell(3,i).getContents());
int amt = parseAmount(sheet.getCell(3,i).getContents());
String comment = sheet.getCell(4,i).getContents();
if (email == null || email.trim().equals("")) {
throw new IllegalArgumentException("No email found");

if (comment == null || comment.trim().length() == 0) {
System.out.println("No comment on line " + (i+1));
List<String> prizes = extractPrizes(comment,amt);
if (isDebug()) {
System.out.print( "prizes for line " + (i+1) + " ");
for (Iterator<String> prizeIt = prizes.iterator();
prizeIt.hasNext();) {
System.out.print( + " " );

if (prizes.size() != amt / 10) {
System.out.println("Line " + (i+1) +
" does not have the right number of prizes for the amt");

// compress the lists down to MFHPair, which includes an email
// and a count. Insert into map, keyed by prize code
Collections.sort(prizes); // make sure they're in order
String curPrize = "";
int curCount = 0;
for (Iterator<String> prizeIt = prizes.iterator();
prizeIt.hasNext();) {
String prizeFromList =;
List<String> bidders = null;
if (retVal.containsKey(prizeFromList)) {
bidders = retVal.get(prizeFromList);
} else {
bidders = new ArrayList<String>();
return retVal;
catch (Exception e) {
System.err.println("There was a problem: " + e);
return null;