<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>jebsblog &#187; captioning</title>
	<atom:link href="http://jebswebs.net/blog/tag/captioning/feed/" rel="self" type="application/rss+xml" />
	<link>http://jebswebs.net/blog</link>
	<description>comments about accessibility and web design</description>
	<lastBuildDate>Thu, 26 Aug 2010 14:58:29 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Captioning YouTube Videos</title>
		<link>http://jebswebs.net/blog/2010/05/captioning-youtube-videos/</link>
		<comments>http://jebswebs.net/blog/2010/05/captioning-youtube-videos/#comments</comments>
		<pubDate>Thu, 27 May 2010 18:39:45 +0000</pubDate>
		<dc:creator>jeb</dc:creator>
				<category><![CDATA[Accessibility]]></category>
		<category><![CDATA[General Information]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[captioning]]></category>
		<category><![CDATA[speech-to-text]]></category>
		<category><![CDATA[transcription]]></category>
		<category><![CDATA[YouTube]]></category>

		<guid isPermaLink="false">http://jebswebs.net/blog/?p=509</guid>
		<description><![CDATA[Back in March 2010, I rather gleefully blogged about YouTube&#8217;s latest feature called &#8220;automatic captioning.&#8221; Since that time, I have become bemused and amused by the state of this &#8220;service.&#8221; It seems Google &#8211; the owners and operators of YouTube &#8211; have been using our videos as fodder for their new Google Voice speech-to-text (S-t-T) [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://jebswebs.net/blog/wp-content/uploads/2009/12/youtube_logo.jpg"><img class="alignright size-full wp-image-276" title="youtube_logo" src="http://jebswebs.net/blog/wp-content/uploads/2009/12/youtube_logo.jpg" alt="You Tube logo" width="264" height="198" /></a>Back in March 2010, <a href="http://jebswebs.net/blog/2010/03/captioning-and-youtube/">I rather gleefully blogged about  YouTube&#8217;s latest feature called &#8220;automatic captioning.&#8221;</a> Since that  time, I have become bemused and amused by the state of this  &#8220;service.&#8221; It seems Google &#8211; the owners and operators of YouTube &#8211;  have been using our videos as fodder for their new <a href="http://www.google.com/voice">Google Voice</a> speech-to-text  (S-t-T) translation machine. Google claims, &#8220;It (Google Voice transcripts) will  improve over time as our transcription engine gets smarter.&#8221; It is not  clear how the Google transcription engine will get &#8220;smarter,&#8221; but  I&#8217;m, figuring the more the system is used, the more it will learn, and the  smarter it will become&#8230;make sense?</p>
<p>Whoever perfects S-t-T stands to make billions in the first  year, so it stands to reason Google would be interested in tapping into that treasure  chest. But perfecting S-t-T has always been an elusive goal and anyone worth  their salt in the captioning or transcription business knows the human beings  still make the best captionists.</p>
<p>That said, at the recent <a href="http://jebswebs.net/blog/2010/05/the-unconference/">Accessibility Unconference</a> a few  weeks ago, the issue of S-t-T came up and there was lots of interest in YouTube&#8217;s  &#8220;automatic captioning&#8221; service. I should note here that YouTube  currently calls this a &#8220;machine transcription&#8221; service and offered it  with some caveats. They also seem, in some ways, to be more interested in the  language translation tool that was also delivered on YouTube at the same time.  Perhaps there is more money to be made in the translation of Chinese to English  than in S-t-T.</p>
<p>At the Unconference, there was one gentleman who represented  a transcription service company in Massachusetts that used a system  based upon a combination of automated S-t-T and human power. He claimed that his  system was much faster than regular human-only transcription because machines  take the first cut at the translation and humans completed the final edits. He  also claimed it was flawless. Lastly, he noted that the fee for this service  ranged on a scale based upon the quality of the audio. Apparently, the poorer  the quality of the speech, the more interactions with humans is necessary, and  the more expensive is the price tag.</p>
<p>So all this got me thinking about <a href="http://jebswebs.net/blog/2010/03/captioning-and-youtube/">the experimental YouTube video  I created and posted back in early March</a>. The &#8220;automatic captioning,&#8221;  eh, machine translation, of my video was indeed a bit hilarious. Sharing it  with friends, we all howled at the bizarre transcripts that were produced by  the system. It was a bit like playing that <a href="http://en.wikipedia.org/wiki/Chinese_whispers">children&#8217;s game, &#8220;Telephone,&#8221;</a> where you whisper something into  someone&#8217;s ear and they whisper it into the next person and so on down the line  until the last person says it out loud. The final product never comes out  correctly and is usually quite funny. And indeed, the YouTube &#8220;machine  transcription&#8221; was much the same.</p>
<p>For my test video, I purposely read a printed text -  as  opposed to spontaneous speech &#8211; so I would have an exact copy of the content  from which to compare the transcript. The results were marginal at best and  honestly, the transcript really made no logical sense. It was also amazing what YouTube&#8217;s machine translation failed to recognize. The machine translation had a particular  difficult time with the words &#8220;accessibility&#8221; and &#8220;web  design.&#8221; Go figure.</p>
<p>I recently learned that you could download the YouTube  machine translation, edit it, and then re-post it to the original YouTube video.  So, today I finally got around to trying this and though successful, the  process was not without pain.</p>
<p>First, the machine transcript is saved in some unique  YouTubian format (.SBV). The content is readable using a simple text editor and  looks like this:</p>
<pre> 0:00:02.179,0:00:07.740
   okay so am I- of doing it tested video here
   it and I'm going to read this to see if the
   0:00:07.740,0:00:09.959
 captioning system works well</pre>
<p>Fortunately, my <a href="http://www.synchrimedia.com/">MovCaptioner software</a> could import the file  and provide an easy way for editing the content. But after editing the text, I  could not export the transcript without first merging it with a video. I had to  grab the original video from YouTube (which I downloaded in .MP4 format) and  then load that into MovCaptioner. Once the editing was finished (see note below  about time), I was able to save and export the file in another format (.SUB for  Subtitle format) and then upload that transcript file to YouTube.</p>
<p>The final edited .SUB file looks like this:</p>
<pre> 00:00:02.17,00:00:07.72
   Okay so I am doing a test
   video here and I'm going to
   read this to see if the
   00:00:07.74,00:00:09.94
 captioning system works well</pre>
<p>As predicted, the most strenuous part of the process is the  actual editing of the transcript. Even though the machine transcript had gotten  about 50% of the content correct, it still took close to 45 minutes for me to  edit the three minutes of video. It is clear that I talk pretty fast, as there  was 75 lines of text that had to be edited. I can&#8217;t imagine doing this for  anything longer.</p>
<p>So, I&#8217;ve learned a few things here:</p>
<p>First, YouTube&#8217;s &#8220;automatic captioning/machine  translation&#8221; is far from perfect and must not be used, at this point, for  anything other than amusement. I am not sure if Google has a timeline on when  this will get better, but until it produces accuracy at a 85% or higher basis,  I would not rely on it as a usable transcription.</p>
<p>Second, while machine translation, followed by human editing  is clearly more accurate than machine translation alone, the time savings may  not be all that one might imagine. I&#8217;m guessing that a professional  transcriptionist using state of the art equipment would have been able to  transcribe the three minutes of video a lot faster than I was able to edit the  machined version.</p>
<p>Last, we are still a long way from fully accurate S-t-T and if  you are going to use videos on your websites, and want them to be accessible,  you are probably still going to have to pay someone to create a  transcript/caption file for you.</p>
<p>Note: <a href="http://www.youtube.com/watch?v=6jiFrnFvUJs">jeremykemp has posted a YouTube video </a>comparing human vs. machine translation on several video clips. You can see the errors produced by the machine transcription.</p>
]]></content:encoded>
			<wfw:commentRss>http://jebswebs.net/blog/2010/05/captioning-youtube-videos/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Captioning and YouTube</title>
		<link>http://jebswebs.net/blog/2010/03/captioning-and-youtube/</link>
		<comments>http://jebswebs.net/blog/2010/03/captioning-and-youtube/#comments</comments>
		<pubDate>Wed, 10 Mar 2010 17:55:40 +0000</pubDate>
		<dc:creator>jeb</dc:creator>
				<category><![CDATA[Accessibility]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[captioning]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[MovCaptioner]]></category>
		<category><![CDATA[YouTube]]></category>

		<guid isPermaLink="false">http://jebswebs.net/blog/?p=275</guid>
		<description><![CDATA[UPDATE &#8211; March 10, 2010: Yes, it is true. Google has announced that the &#8220;automatic captioning service&#8221; first detailed in November, is now available to all accounts (channels). It appears that, for now, you have to &#8220;request&#8221; the service (although it appears they automatically had captioned my latest video which was posted several months ago), [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-medium wp-image-276" title="youtube_logo" src="http://jebswebs.net/blog/wp-content/uploads/2009/12/youtube_logo-300x225.jpg" alt="youtube logo" width="151" height="113" /></p>
<p><strong><em>UPDATE &#8211; March 10, 2010: Yes, it is true. Google has announced that the &#8220;automatic captioning service&#8221; first detailed in November, is now available to all accounts (channels). It appears that, for now, you have to &#8220;request&#8221; the service (although it appears they automatically had captioned my latest video which was posted several months ago), and they will eventually get to all of them. Pretty cool. <a href="http://techcrunch.com/2010/03/04/youtube-launches-auto-captions-for-all-videos/">More on the announcement</a>. <a href="http://www.google.com/support/youtube/bin/answer.py?hl=en&amp;answer=100077">Directions on how to caption</a></em></strong></p>
<p>I recently heard the news about the new &#8220;automatic captioning&#8221; that Google is providing to certain <a href="http://www.youtube.com">YouTube</a> accounts. <a href="http://googleblog.blogspot.com/2009/11/automatic-captions-in-youtube.html">According to the &#8220;Official Google Blog:&#8221;</a></p>
<blockquote><p>&#8230;we&#8217;ve combined Google&#8217;s automatic speech recognition (ASR) technology with the YouTube caption system to offer automatic captions, or auto-caps for short. Auto-caps use the same voice recognition algorithms in <a href="http://googleblog.blogspot.com/2009/03/here-comes-google-voice.html">Google Voice</a> to automatically generate captions for video. The captions will not always be perfect (check out the video below for an amusing example), but even when they&#8217;re off, they can still be helpful—and the technology will continue to improve with time.</p></blockquote>
<p>Apparently, Google is rolling this out with a select group of partners and on specific channels. My understanding is that Google will simply start captioning videos in these groups using this new automatic system.</p>
<p>Anyone who knows anything about captioning knows that automatic systems are fraught with problems. It seems the best captioners are still human beings. And, well, I&#8217;m guessing Google is not interesting in hiring half the population of the planet and training them to become transcriptionists. Cause that&#8217;s what it would probably take to get enough human power to deal with the zillions of <a href="http://www.youtube.com">YouTube videos</a> out there.</p>
<p>But if you can&#8217;t wait for Google to automatically caption the home videos of your kids opening their Christmas presents, you can use another, lesser-known, and equally free service called <a href="http://captiontube.appspot.com/">CaptionTube</a>. It is not clear from my reading if <a href="http://captiontube.appspot.com/">CaptionTube</a> is a service that <a href="http://www.googlelabs.com/">Google Labs</a> developed themselves or whether is was acquired through some kind of company merger, but in any case, the price is right. I&#8217;m still playing with it so I don&#8217;t have an official opinion yet. If you are a master user, send me a comment or an e-mail.</p>
<p>I have, for a year or so, been also playing around with an application called <a href="http://www.synchrimedia.com/">MovCaptioner</a> that runs on the Mac OSX. <a href="http://www.synchrimedia.com/">SynchriMedia, the maker of MovCaptioner </a>has been promising a Windows version, but I&#8217;m thinking CaptionTube might be the right product at the right price. MovCaptioner costs $39.95 for one license which provides free updates. Multiuser licenses are also available for a discount.</p>
<p>Both <a href="http://www.synchrimedia.com/index.html">MovCaptioner</a> and <a href="http://captiontube.appspot.com/">CaptionTube</a> work essentially the same way. You load your video (in the case of CaptionTube, you can work off an existing YouTube video that has already been  published). As you play back your video in the application, you can stop (marking the time code automatically) and type in what the people on the video are saying. It is not really easy to do, so I have developed an new affinity for the people who do this work professionally. People do not talk in nice tight sound bytes, so you will quickly find it is hard to &#8220;stop the tape&#8221; at the appropriate spot and add the caption. You also have to have pretty good listening skills. You will end up often repeating the clip to get the wording correctly. Again, it&#8217;s not easy.</p>
<p>After you have created the text for your captions, you click some buttons, uploading the caption file, and check back in a little while and see your YouTube with captions. In the case of MovCaptioner, you have a number of options for saving and publishing your video. MovCaptioner has the advantage of saving a file that can use it with, or converted for use with any media player, not just the Flash media player that YouTube uses.</p>
<p>Both captioning systems appear to use an &#8220;closed caption&#8221; method meaning the caption transcript is kept separate from the video file (not embedded like subtitles in old movies). It can be turned off and on by the user, and the transcript itself can be saved and used separately &#8211; with or without the time codes. This is a nice option.</p>
<p>I&#8217;ve made this all sound very simple; it&#8217;s not. But, it is not all that difficult either. Like anything, it is an acquired skill.</p>
<p>I am hoping this new automatic service from Google takes off and become universally available soon. At the very least, Google could first provide this as a service for folks who need to get their videos captioned now (e.g., educational institutions, governments, etc.). Maybe even open it up with invites like they did with GMail and GoogleWave. I&#8217;d be happy to be a beta tester.</p>
<p>Anyway, a solution to finding a quick and inexpensive way of captioning short videos is coming closer to fruition. Exciting times. Stay tuned!</p>
]]></content:encoded>
			<wfw:commentRss>http://jebswebs.net/blog/2010/03/captioning-and-youtube/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>
