<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Jochem Prins &#187; Media</title>
	<atom:link href="http://www.jochemprins.com/category/media/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.jochemprins.com</link>
	<description>about search, Web2.0 and things I like</description>
	<lastBuildDate>Sun, 20 Sep 2009 11:28:33 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Searching through video&#8217;s with speech-to-text</title>
		<link>http://www.jochemprins.com/2009/02/10/searching-through-videos-with-speech-to-text/</link>
		<comments>http://www.jochemprins.com/2009/02/10/searching-through-videos-with-speech-to-text/#comments</comments>
		<pubDate>Tue, 10 Feb 2009 18:22:24 +0000</pubDate>
		<dc:creator>Jochem</dc:creator>
				<category><![CDATA[Media]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[exalead]]></category>
		<category><![CDATA[speech to text]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://www.jochemprins.com/?p=134</guid>
		<description><![CDATA[Speech recognition is one of those technologies which has been around for quite a while, but has not yet found it&#8217;s way to large-scale utilization in the industry. Yes, we&#8217;ve probably all talked to a computer of the airline reservation center once in our live, but I wouldn&#8217;t call that real speech recognition. You will [...]]]></description>
			<content:encoded><![CDATA[<p>Speech recognition is one of those technologies which has been around for quite a while, but has not yet found it&#8217;s way to large-scale utilization in the industry. Yes, we&#8217;ve probably all talked to a computer of the airline reservation center once in our live, but I wouldn&#8217;t call that real speech recognition. You will have to choose between a couple of words, and if you say something different, they will redirect you to one of the choices anyway. This is what we call a small vocabulary speech recognition application. It is useful (I guess) but not what I think the best use case of speech technology.</p>
<p>The main problem researchers are facing is that each person&#8217;s style of speech is very different. And, especially if more people are in the same conversation, the speech recognition technology should be able to deal with all those different styles and vocabularies. I don&#8217;t expect it to take many more years before technology will be able to deal with those complications but there is some good news already!  <span id="more-134"></span></p>
<p>For some (actually quite a lot) possible applications of speech recognition, you don&#8217;t require a 100% accuracy. If it&#8217;s not strictly necessary to provide a completely accurate transcription of the speech, the current technology can already be very useful. For example, wouldn&#8217;t it be nice to:</p>
<p>- Automatically transcribe your phone calls, meetings, and personal memo&#8217;s into text for later reference<br />
- Search through the content of a podcast archive for interesting episodes<br />
- Search through the content of video&#8217;s for interesting shows</p>
<p>In order to achieve the latter, Exalead has partnered with <a title="LIMSI" href="http://www.limsi.fr/" target="_blank">LIMSI</a> and developed <a title="Voxalead" href="http://voxalead.labs.exalead.com/SpeechToText" target="_blank">Voxalead</a>. With Voxalead, you can search through the content of the video&#8217;s and directly play the video with the embedded player. The player displays the transcription of text next to the video and clicking on a word brings the video directly to this precise time.</p>
<div class="mceTemp mceIEcenter">
<dl class="wp-caption aligncenter" style="width: 460px;">
<dt class="wp-caption-dt"><img title="Voxalead" src="http://labs.exalead.com/images/labs/voxalead.png" alt="Voxalead" width="450" height="200" /></dt>
</dl>
</div>
<p>There will definitely be plenty of errors in the transcription, but&#8230; it doesn&#8217;t matter! The goal is to search through video&#8217;s and if some words are not recognized, no problem at all. I think it&#8217;s an excellent example of how to utilize the current speech-to-text technology in a useful way. What do you think?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.jochemprins.com/2009/02/10/searching-through-videos-with-speech-to-text/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Ha Ha! Your medium is dying!</title>
		<link>http://www.jochemprins.com/2008/04/12/ha-ha-your-medium-is-dying/</link>
		<comments>http://www.jochemprins.com/2008/04/12/ha-ha-your-medium-is-dying/#comments</comments>
		<pubDate>Sat, 12 Apr 2008 11:16:30 +0000</pubDate>
		<dc:creator>Jochem</dc:creator>
				<category><![CDATA[Media]]></category>

		<guid isPermaLink="false">http://www.jochemprins.com/2008/04/12/ha-ha-your-medium-is-dying/</guid>
		<description><![CDATA[Print is dead&#8230;

]]></description>
			<content:encoded><![CDATA[<p>Print is dead&#8230;</p>
<p><embed src="http://www.liveleak.com/e/527_1205782611" type="application/x-shockwave-flash" pluginspage="http://www.macromedia.com/go/getflashplayer" scale="showall" name="index" height="370" width="450"></embed></p>
]]></content:encoded>
			<wfw:commentRss>http://www.jochemprins.com/2008/04/12/ha-ha-your-medium-is-dying/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
