Passing the time waiting for Wikipedia by musing on SOPA and multimedia information retrieval
It's like the SOPA blackout blacked out my brain or something. I am tired and cranky and feel like I basically spent my day just waiting for Wikipedia to come back on line. I guess I was actually pretty productive, but I did spend a lot of time looking at Twitter. And in fact was looking when a tweet on darkening Flickr photos came through from Mor Naaman and led me to his darkened photo in the screenshot above.
Yes, of course I am a stop SOPA person. If its piracy today, it's free speech under attack tomorrow... my thoughts naturally flow along those lines of logic. If online piracy is indeed a problem, there's gotta be a more creative, effective solution that one concocted by Congress.
But then it occurred to me that maybe SOPA actually would have positive implications for the effectiveness of multimedia retrieval. I mean, hey, if SOPA is going to force website owners to police user-contributed material anyhow, then we should put those website owners to work enforcing complete and consistent metadata.
For every video, every image, SOPA wants to know who it belongs to. Well, we could also require a complete description of its visual content. Stop Online Messy Metadata! Let's propose a SOMMA.
Let's require each multimedia items to have a minimum number of tags, and have those tags to be drawn a set of concepts organized in a formal ontology. The US Federal Government could actually start publishing one -- updating it each year to cut down on those concepts known to not contribute constructively to retrieval. (We could start with LSCOM.)
Let's require people posting videos to make sure that their speech transcripts have been corrected are are flawless (...avoid using words not in the vocabulary of the speech recognizer) and that their songs are published with links to pages containing the lyrics (...avoid including references to obscure Russian novels).
And then to control that overwhelming multimedia flood, we can set limits. Individuals should only publish a small quota of items, so that these can all be kept in a central index and found by a topical look up -- Dewey Decimal Classification style.
Formulating an information need is easy: just choose one from a well organized standardized list, defined to cover everything you could possible want to see in an image.
Basically, if they put their minds to it the US Federal Government could really improve multimedia retrieval.
Well, OK. I'm a stop SOPA person because I like my job and I like my messy world and the messy metadata that goes with it.
In its subtle way, Flickr is telling us that its actually really difficult as an Internet user to be pro-SOPA. The caption on the blacked out photo above is:
A Flickr user has darkened this photo for up to 24 hours as a symbolic gesture to raise awareness about the potential impact of two pieces of pending U.S. anti-piracy legislation - SOPA (The Stop Online Piracy Act) and PIPA (The Protect IP Act).
While the inclusion of this blackout screen is merely a symbolic gesture -- not an actual allegation that this photo violates any copyright -- a user wanted you to take note of the harmful impact these bills could have on innovation online. For more information see http://blog.flickr.net
And then at the bottom there's a link labeled:
Show the photo anyway. I do not wish to participate. Do not darken my photo.
OK. Wait a minute, if I click "Do not darken my photo" is that then literally or symbolically? Flickr is subtly showing us how natural the extension is from our daily use habits to the decision to support the principle-based movement to stop SOPA and preserve the basic freedom of Internet flow.
Look at how this works: Symbolically, what the darkened photo is, is a big message screaming "Do not darken my photo!!" to the US Federal Government. So aren't I really by clicking this link demonstrating that I am against a specific darkened photo, which is nearly the same thing as being against darkened photos in general? And from there also against legislation that might blot out some of that rich multimedia primordial soup that we take in, digest and put out as conversations, comments, more multimedia, the fabric of our online lives?
Hey, SOPA, leave my Internet alone. I do not wish to participate. I want my brain lit back up.
I divide my time between Radboud University Nijmegen and Delft University of Technology in the Netherlands. My research focuses on multimedia retrieval techniques that exploit speech and language and focus on human interpretations of meaning. I am particularly interested in internet video, in networked communities, and crowdsourcing techniques. Lately, I've been noticing how difficult it is to imagine life without search.