Upcoming appearances (San Diego & Seattle!)

This weekend I’ll be in San Diego for ConDor, a friendly neighborhood sci-fi convention! I will be speaking, opining, and making enemies discussing the following:

SATURDAY, 4PM: “Synergy between Writer and Illustrator” — Kevin Gerard, Jennifer K. Fong, Laura Brodian Freas, David Malki !, Sue Dawe.

SATURDAY, 5PM: “Extemporaneous Story Telling: Grand Unified Conspiracy Theory 2” — Our panelists take elements from the audience and tell the conspiracies that interconnect them. Gradually they interweave these elements into a theory that explains EVERYTHING. David Malki !, Sherwood Smith, Chris Weber, David Ross, Mike Bocianowski.

SUNDAY, 10AM “The Making of Wondermark” — Comics created live on stage, in front of your very eyes!

SUNDAY, 11AM Autograph session in the dealer’s room.

SUNDAY, 12PM “Ebooks and the Future of Publishing” — Richard Dean Starr, John Oliver, David Malki !, Val Ontell, J. M. Perkins. (I’m moderating this one.)

SUNDAY, 1PM “What Makes a Story Steampunk?” — David Malki !, Scott Farrell, Denise Dumars, Sharon Mock, David Lee Summers.

SUNDAY, EVENING “David please go to bed” — My brain

At the end of March I’ll also be in Seattle for the Emerald City Comicon, which is three days long this year and will surely, surely be an amazing show. Come say hello!

Also on my calendar for the spring are:

Calgary Comic & Entertainment Expo, April 27-29

Toronto Comic Arts Festival, May 5-6

Vancouver Comic Arts Festival, May 26-27 — My first-ever Vancouver appearance. (Oh, and the show is soliciting volunteers now!)

And maybe others too!

Wondermark Randomizer

Another development with the corpus: programmer Matt has created a Wondermark random script generator that uses fragments from existing comic transcripts to create all-new (randomized) versions. For example:

Panel 1

Hugo: Jen hard to stop a tiny dinosaur working a series of letters. Glen it was hard.

Panel 2

Hugo: It sounded kinda human… but that’s not doing it in extra-small! Hello?

Jeff: Washington, I can hear something.

Hugo: A chef, the bearded man in top hat and monocle. Stop eating the laces out of the salmons, you want a freakin’ recipe. Well…I hate to say that yesterday was “awkward”.

Panel 3

Jeff: Seven pounds, three ounces, twenty inches long. Garth oh, baby! Plus, for many of these theories are true friends. It’s better than winning an ice cream turns to soot, or my kinks.

Hugo: Home. They miss their parents, their sweethearts.

Jeff: You’ll chastise me awkwardly in the service, it’s how some people don’t talk to themselves!

Another:

Panel 1

Researcher: It is unfortunate…but, Wendy…it has happened.

Panel 2

Researcher: I cried a buttload. They’re molehair — we can all agree that I cried that day. Marc running the taffy machine again and not maketh more, thou shalt be put to use a computer.

Student: Sometimes I overstock my cart just to teach the baby Portuguese. Your mother, she is one stone cold super-fox. Don’t worry.

Researcher: Joe the country’s industries have shut down. Absolutely unbelievable!

Student: Stay up late with me. They will breathe with gills that make public radio possible.

Panel 3

Student: That sandwich will be bibliophibians. Consider this!

Panel 4

Student: Hey lady, how much are you enjoying that sandwich, boy? He replaces his monocle with a baby.

Panel 5

Student: Frikkin bright!!

It is pretty super-great, especially when you imagine all of them being read by Morgan Freeman. Make your own! Thanks, Matt!

Word Cloud, etc. Part 2

Here are some responses to the word cloud and corpus I released earlier…

John B. submits the above image, created from just a list of episode titles, using Tagxedo. (I used Wordle for mine.) He also points out that Tagxedo contains more customization filters and tools for creating word clouds, which is good to know. Thanks, John!

Rubrick correctly points out that “It should, I feel, be a crime to refer to ‘words that show up once and only once’ without using the awesome linguistic term for such words, hapax legomenon.

Apparently there are ~6100 such words in the corpus, and they have helpfully been extracted by Jonathan B. here. How boring a writer I am to have only used ‘breakdancing’ once in nine years!

Elytsvil accurately notes that the corpus I released is missing a lot of punctuation, which Oh No Robot uses as meta-markers. I did streamline the textfile somewhat (eliminating URLs, strings of punctuation, and the ubiquitous ‘In which’) to try and force the word cloud into something approaching relevance, but the point is duly noted. Here is a completely unredacted ONR data export.

Finally, Shmibs extracted a list of the longest words in the corpus, and some of them are pretty great:

nervousenergynervousenergynervousenergy (from the alt-text on #016)

abcdefghijklmnopqrstuvwxyz — from #598

mmmellllltiiiinnnnngggggg — from #247

procrastihibernation — title of #614

radiogrammephonimat — a transcriber-added detail to #401

biepinzingerunting — from #715

biepbiepbieperzung — also from #715

telegrameutophium — a transcriber-added detail to #336

relationshaaaooww — from #588

yardeyardeyaryar — from the alt-text (which I wrote) on #199 (a guest comic)

superdorkasaurus — from the alt-text on #662

dunderschnauzen — from #515

glondxhatzoljlg — from #680

What an erudite collection of completely invented words and/or sounds!

800 Episodes Word Cloud

On the occasion of Wondermark’s eight hundredth episode, I thought I would celebrate by looking at a complete corpus of words used in Wondermark, and creating a cloud from them (similar to my existing tag cloud of subject matter):

“Huh,” I thought to myself, “I suppose it is unsurprising that the most common words used in a large sample of comics probably closely resembles a list of common words found in the language in general.”

So no great discoveries here, unfortunately. It’s further complicated by the fact that the text I’m using as a corpus is an export of my Oh No Robot database, which contains user-submitted transcriptions of all my comics, which themselves often contain transcriber-invented character names and extensive scene descriptions — both of which are great, but which somewhat muddy the dataset. The heavy incidence of the words “man” and “woman” in the cloud, for example, are probably due to transcriptions reading something like:

Man: I have started a bean farm.
Woman: We’ll be millionaires!
Man: Not if flies eat the crops first.
Woman: Time to invest heavily in pesticides.

In that sample transcription, the words “man” and “woman” both appear twice as frequently as any other word, despite not occurring in the dialogue at all.

It’d be neat to see, instead of a brute word-frequency cloud, something like a collection of statistically improbable phrases, or words that show up in Wondermark once and only once…things like that. I wonder what interesting things could be mined from the data? If you’d like to play around with the corpus yourself, dirty as the set is, here’s the text file I used. If you derive anything neat, let us know!

Check out: Surgical book sculptures

These book sculptures by Brian Dettmer are amazing.

“Using knives, tweezers and surgical tools, Brian Dettmer carves one page at a time. Nothing inside the out-of-date encyclopedias, medical journals, illustration books, or dictionaries is relocated or implanted, only removed.”

A million more pictures at the link. Thanks to Marksman John H. for the tip!