0:00:06yeah
0:00:08i
0:00:09well
0:00:10describe the
0:00:12two thousand and nine
0:00:14on this
0:00:15language
0:00:16recognition evaluation
0:00:18lre O nine
0:00:20um
0:00:21this
0:00:23uh
0:00:24discordant in this evaluation now
0:00:27and your
0:00:28U S
0:00:28government sponsorship and
0:00:30this work was largely done with
0:00:32great greenberg like owing
0:00:34in them this yeah
0:00:35multimodal
0:00:36information group
0:00:42so the two thousand nine
0:00:44evaluation
0:00:46was the fit in the
0:00:48series of
0:00:50this coordinated lre is the first was in ninety six
0:00:54and everything yeah
0:00:56and we're evaluations in two thousand
0:00:59three and two thousand
0:01:00five
0:01:01two thousand seven
0:01:03two thousand nine
0:01:05uh oh
0:01:05one might suspect that the
0:01:07could be another evaluation
0:01:09twenty eleven
0:01:10um
0:01:12trying to to that in nine
0:01:15he changes
0:01:16uh we're in the
0:01:18nature of the data
0:01:19say more about that
0:01:21the treatment of dialogue
0:01:22dialect
0:01:23mutually intelligible languages
0:01:26and in the
0:01:27set of evaluation test condition
0:01:29we will get to those
0:01:30um the data the
0:01:33oh oreo nine or
0:01:35indicated there there were
0:01:36eighteen total
0:01:38participating sites
0:01:42um
0:01:43the prior
0:01:44nist evaluations
0:01:47used conversational telephone speech
0:01:50this involved
0:01:51paying subjects
0:01:52yeah
0:01:53they call it
0:01:54nature
0:01:55language recognition you just wanna make a single call
0:01:58in their native language
0:02:00ah
0:02:01in the U S preferably control channel conditions
0:02:04this
0:02:06paradigm is becoming expensive and impractical
0:02:09it's hard to pay people to make single call these days
0:02:12talk to me
0:02:14um
0:02:14um
0:02:15helpful
0:02:17um
0:02:18access
0:02:19is easy
0:02:21so lre O nine
0:02:22attempted to use primarily
0:02:24down data
0:02:26in this case
0:02:27down
0:02:28uh
0:02:29from what voice of america
0:02:31right yes
0:02:33data
0:02:34um
0:02:35this was this
0:02:36sampled by the
0:02:37with the
0:02:38uh data consortium actually
0:02:40found data from
0:02:42three different yours of
0:02:44um
0:02:46voice american to where you started about data
0:02:49the L D C S
0:02:51other conferences separately reported on this
0:02:53data collection
0:02:54the a feasibility study of using this data for lre
0:02:59was done before and then
0:03:01done in the by
0:03:02researchers
0:03:04uh
0:03:04here at the brno university of
0:03:07of of technology
0:03:08uh that was a key part
0:03:10a lot in the data for this evaluation
0:03:13um
0:03:13the selected
0:03:14segments that were
0:03:15actually
0:03:16used for testing also does it for
0:03:18development work
0:03:20uh segments
0:03:21uh
0:03:22determined by wanting to be involved narrow bands
0:03:24speech
0:03:25and we want to get as many different speakers as possible
0:03:28um
0:03:29evaluation also use
0:03:30cts data that had been collected previously but for various reasons
0:03:34and not been used in the uh
0:03:36in the prior evaluation
0:03:41so
0:03:43um
0:03:45here is our list of target languages for this evaluation and then using found data is that
0:03:51have
0:03:52um
0:03:54more
0:03:55target languages
0:03:56indeed we had that
0:03:59twenty three
0:04:00in this case
0:04:01ah
0:04:02in some cases we just list it is languages quite that would trees have been created is
0:04:07dialect
0:04:08um
0:04:09english american and in american english and indian english or
0:04:13also uh
0:04:14indian
0:04:16and working so this one says we just do these into it
0:04:20a single part languages will talk about the language here condition
0:04:25um
0:04:27any D
0:04:29we specified eight
0:04:32um
0:04:33language pairs as being a
0:04:34particular interest
0:04:36um
0:04:37they either languages that are
0:04:39similar patient
0:04:40english dialects uh
0:04:43indian
0:04:44or do may be viewed as a dialect station
0:04:47other languages are
0:04:49many cases mutually intelligible post processing croatian
0:04:53a real
0:04:54haitian and french are of interest
0:04:57uh
0:04:57include such pairs
0:04:59it's cantonese mandarin spanish
0:05:01portuguese
0:05:02so we specify these as
0:05:04these eight as being the
0:05:06of particular interest for those who
0:05:09wanted to investigate um
0:05:13uh
0:05:14so the
0:05:16evaluation
0:05:18ah consist of a long
0:05:20series of trials for each of the
0:05:23in addition
0:05:24and as in the past
0:05:25we
0:05:26charlie's
0:05:27test segment
0:05:29our approximately thirty or approximately ten or parts
0:05:31really
0:05:32a three seconds of speech
0:05:36uh i for each trial
0:05:39you have
0:05:40a target language hypothesis
0:05:43and
0:05:44and alternative
0:05:45i thought this
0:05:48and for each
0:05:49ah
0:05:50a trial we require
0:05:52i passed a decision
0:05:54and the score
0:05:56yeah we
0:05:57specify three
0:05:58different
0:05:59has conditions this year
0:06:01the close second edition this is the
0:06:04traditional condition it
0:06:06but part of all the evaluation is required condition
0:06:09and this reach
0:06:11language segment
0:06:14oh you have a one of the target languages
0:06:17as i thought
0:06:18each segment is running
0:06:19really
0:06:20target is a part
0:06:21the alternative hypothesis is
0:06:23it's a different target language
0:06:25one of the other twenty two
0:06:27the open second edition
0:06:29the alternative i thought this is
0:06:32not simply that
0:06:33one of those twenty two languages it could be that they could also be some other language an unknown how
0:06:38does that language
0:06:40and finally we introduce the C of the language here condition
0:06:44which is designed to look at
0:06:46ah
0:06:47i just distinguishing here so that the
0:06:50i have this in all cases a single
0:06:53line you know
0:06:55target languages english the alternative
0:06:57uh is that it
0:06:59it's french
0:07:00um
0:07:01ah so
0:07:02there are two and we twenty three target languages there are two hundred fifty three pairs and
0:07:07a part of language you want to look at this way and
0:07:10systems were invited to do
0:07:11all of them
0:07:13only a couple chose to do so
0:07:14or selected ones in particular the
0:07:17eight a
0:07:18mentioned above
0:07:23uh this gives you some
0:07:24indication of the
0:07:27um
0:07:27training and
0:07:28test segments
0:07:29that will provide in there
0:07:31there's a
0:07:33source so
0:07:34a green
0:07:35language it
0:07:37it indicating the number of segments and between segments of each duration
0:07:41uh
0:07:43um they were providing it'd be a weight training or be away
0:07:48ah yes
0:07:49where cts that that all the
0:07:51cts data from previous evaluations where
0:07:54we're also available
0:07:55um
0:07:56and the B Y training we provided
0:07:58you know we provided lots of data and not just limited to these selected segments but
0:08:03oh
0:08:04a corporate move around
0:08:06terabytes
0:08:07uh
0:08:08a drive the route
0:08:10but we're distributed people but
0:08:12rich language we haven't had about two hundred uh
0:08:15from the really data
0:08:16a segment of each
0:08:18duration separated
0:08:19but that S yeah
0:08:21we had open
0:08:22three or four hundred
0:08:23alright
0:08:25quite depending on availability
0:08:26and
0:08:27we we had
0:08:28training in languages which
0:08:32i i'm a
0:08:34and that we had with training data
0:08:36are all the languages for which i was not
0:08:38ah
0:08:40previous cts data in many languages that relevant data with cts but the new
0:08:45you data could be the other way
0:08:47um
0:08:48so
0:08:49that's
0:08:52numbers are there eighteen side
0:08:54they're listed here are many lamar
0:08:56represented in
0:08:57this room
0:09:00evaluation metric with the
0:09:02traditional metric yeah
0:09:04we have used a
0:09:05is essentially something like yeah
0:09:07total error rate
0:09:09we
0:09:10equally weight
0:09:11a lot of miss the cost of false alarm
0:09:15take an average of miss rate the false alarm rate but we
0:09:18average that over all possible
0:09:21oh
0:09:21uh target languages all possible alternative languages
0:09:25and they ended
0:09:26uh
0:09:27computed this way
0:09:28there's also waiting indicator for the open second edition
0:09:31of how we wait that
0:09:33the outer set alternative to the
0:09:35for the
0:09:36are actual target languages
0:09:40so it's turn
0:09:42results so
0:09:44terms of the official metric
0:09:46uh these are the results
0:09:47four systems are
0:09:49the average scores uh
0:09:51the close any open set
0:09:53in addition
0:09:54uh the scores are cumulative so that the three seconds or
0:09:59is the total of the green and the yellow and there
0:10:01red bar
0:10:04oh
0:10:05opens
0:10:05it's close to laugh opens another right we have
0:10:08labels
0:10:09oh some systems indicate yeah
0:10:12the same system close at an open set
0:10:14traditionally we have not identified
0:10:17systems with their scores are
0:10:20ah
0:10:21in public presentations but you can
0:10:23uh they're open close
0:10:25i was in languages and you know it
0:10:27you know
0:10:27it's really
0:10:28three seconds or ten seconds the three seconds that takes the
0:10:32big performance
0:10:32yeah
0:10:33it
0:10:33close in all three
0:10:35his clothes and language here is a
0:10:38oh but
0:10:39two sides
0:10:40be
0:10:41yeah yeah
0:10:43uh language we wouldn't see
0:10:45the relatively
0:10:47uh
0:10:48good performance as you might expect on
0:10:50and language pairs
0:10:55and we traditionally put these on
0:10:57yeah what
0:10:58uh
0:11:00there are that part with the
0:11:02close to have them alive we have the various uh
0:11:06that's another
0:11:07thirty second
0:11:08uh and of the right for once
0:11:10we give a flavour that
0:11:11different or in thirty seconds
0:11:13and second
0:11:14three seconds the
0:11:16linearity of the
0:11:17most of what
0:11:18uh
0:11:19suggest underlying
0:11:20normal distributions
0:11:22uh
0:11:24it was open set and you can
0:11:26see that
0:11:27problem you taking going
0:11:29what was that the
0:11:30open so that uh
0:11:34oh there we
0:11:35on the right but up of the
0:11:37close that an open set for each of the
0:11:39a three durations are
0:11:41uh give you
0:11:43a sense there
0:11:49findings an analysis
0:11:53um
0:11:54yeah
0:11:55and i will talk about the effect of
0:11:57averaging
0:11:58in that while the other terms
0:12:00pulling back at work
0:12:02moving away from the term
0:12:03cool
0:12:04we had a long discussion at the workshop is it right to
0:12:09average
0:12:09get
0:12:10across multiple we have the same data multi try out the multiple languages
0:12:15and we
0:12:16then resolve that
0:12:17what with all that but
0:12:18see that
0:12:20funny thing that happened in particular for the
0:12:22and here is is is
0:12:24two systems were right then
0:12:26ukrainian
0:12:29ah
0:12:30uh
0:12:31so the regions where they
0:12:33cranium language type uh
0:12:34that's in the
0:12:35lou the russian language that uh this
0:12:38and these
0:12:39yeah
0:12:40inherently a symmetry
0:12:43uh
0:12:44uh between these these cars 'cause
0:12:46this is the page that
0:12:47i think the only possibility does it
0:12:49russian or ukrainian
0:12:51and if you
0:12:52average those pulling together
0:12:55what happens or system on the combined curve and black
0:12:58all right through the middle
0:12:59that's what you
0:13:01expect random one
0:13:03system too
0:13:06the
0:13:06binder
0:13:09where uh
0:13:11uh i mean
0:13:12lester combined performance
0:13:14one is that um
0:13:18uh we show the
0:13:20distributions
0:13:21uh on the road
0:13:23um the rhino records for the two languages and then
0:13:26different shapes
0:13:27and uh another thing to note
0:13:29is the
0:13:30choruses
0:13:31show
0:13:33the actual decision points the circles we
0:13:36a minimum
0:13:38the average
0:13:38point
0:13:40and
0:13:41the first
0:13:42system on
0:13:43they're right on top of one another in the middle of
0:13:46but
0:13:47calibration
0:13:50two
0:13:51the
0:13:52right there
0:13:53way
0:13:54at the extremes
0:13:56in the case and uh with the
0:13:57sort of the middle indicating it indicating for calibration
0:14:01combine them
0:14:02but
0:14:03hello
0:14:04to it
0:14:04i is what you see
0:14:07um
0:14:08so as i said their questions
0:14:10is it the right thing to
0:14:12average
0:14:13across languages
0:14:14um
0:14:15we have done so
0:14:18if you look at language pairs
0:14:22uh this is for one system
0:14:24one of the system that all the language pairs
0:14:26are we look
0:14:27at
0:14:28george dunning created
0:14:29the curve i believe
0:14:31ah
0:14:31this looked at
0:14:32although there isn't shows the ones that have the
0:14:34why
0:14:35that
0:14:36um
0:14:37average error rate
0:14:39um
0:14:40so all the others were
0:14:42low two percent
0:14:43ah
0:14:45most confusable up of the top word
0:14:48in the or do
0:14:49and by then
0:14:51croatian
0:14:52um
0:14:53these were among the black
0:14:55pairs of interest in
0:14:56uh you know these are certainly mutually intelligible they may be considered dialect
0:15:01and indeed
0:15:03oh yeah
0:15:04at least arguable that
0:15:06he
0:15:06these
0:15:07language or dialect distinctions are based
0:15:09but also and political
0:15:11boundaries are
0:15:13are rather than um
0:15:17then uh more inherent language patterns
0:15:19any case those two of the most confusable
0:15:22next one for russian ukrainian
0:15:24the
0:15:24english
0:15:26dialect
0:15:26and
0:15:27a dari farsi which are
0:15:29generally considered
0:15:31usually
0:15:32palatable given to you
0:15:33you are there is a god in there
0:15:35real french and
0:15:36is
0:15:37is uh
0:15:39in the list
0:15:39um
0:15:42uh when we
0:15:42several of them
0:15:44no
0:15:46a little
0:15:47list of leading
0:15:48one
0:15:49two that were in our that's the
0:15:51pairs of interest
0:15:52yeah nice and mandarin
0:15:54portuguese and spanish
0:15:55maybe certain
0:15:57different ways
0:15:58languages that might be regarded a similar effect
0:16:00um
0:16:02maybe aren't in at least
0:16:03for the
0:16:05a system involve or not
0:16:07all that hard
0:16:08distinguish
0:16:12all that we can look at
0:16:13uh
0:16:14the terms were in the right
0:16:16to a particular target languages towards the
0:16:20if you of everything price
0:16:21languages here we do so looking at the training corpus
0:16:25type
0:16:25the
0:16:26they show the various
0:16:27languages for the
0:16:29that he had a training on the
0:16:32be away data
0:16:33and then we look at the ones that training on
0:16:35cts data
0:16:37um
0:16:37you see kind of a movement
0:16:39how would be either way
0:16:41ah yes
0:16:42performance
0:16:44was on
0:16:45one two Q is that we're languages
0:16:48um
0:16:49but uh
0:16:50done previously among many cases the training the cts and the
0:16:53yeah but
0:16:54realigned unless spanish korean
0:16:57mandarin
0:16:58for example were among the best performing languages
0:17:00worst performing or several
0:17:02indian languages i mean other confusions there in the
0:17:05or do indian english
0:17:12oh yeah we look at performance by
0:17:15but the
0:17:16what was it
0:17:17test corpus whether it be away or cts
0:17:20thirty hand and three
0:17:22um
0:17:25and
0:17:25one thing we were
0:17:26sorry please with
0:17:27you know we just introduced
0:17:29using the only data
0:17:30you know with the
0:17:32we we recognise well in fact
0:17:34the overall performance was probably comparable
0:17:38um
0:17:39this even though for some of the V O A languages that are
0:17:42training with cts
0:17:43four
0:17:44some reason i don't know we know why
0:17:46the uh
0:17:47cts
0:17:48curves here appear less linear
0:17:53and some history
0:17:56so we like to
0:17:58but back
0:17:59over the course of several evaluation
0:18:01how things change
0:18:03are we seeing better performance there have yet
0:18:05that that
0:18:06ah
0:18:08okay we have occurs over there
0:18:09evaluation use of the numbers of target languages
0:18:12go on
0:18:14up in recent evaluation
0:18:16number of participants will open up in them too much recent evaluations but we're
0:18:20yeah slightly into the nineteen thirty seven seven wonderful
0:18:23hereby try to
0:18:25uh
0:18:26you're simply blah
0:18:27and with an increasing number
0:18:30of um
0:18:31out of seven languages
0:18:37as for the basic
0:18:38one of the major
0:18:40um
0:18:43for thirty seconds
0:18:44with that
0:18:46nice
0:18:46uh
0:18:48right and uh
0:18:50you know
0:18:50garcia good
0:18:51data exchange languages it
0:18:53type change that but
0:18:54are we think uh improved results for
0:18:57three second
0:18:58four
0:18:59every second for the past
0:19:01couple evaluations are
0:19:03we seem to
0:19:04yeah
0:19:04have but a
0:19:06i'm terms of the
0:19:07the system
0:19:08also noted this year's three second performance was at the level
0:19:12thirty second performance
0:19:14in nineteen ninety six
0:19:19oh here we
0:19:19do some history looking at the best system
0:19:22you know caviar differences reflect
0:19:25well
0:19:25systems
0:19:26and
0:19:27someone changes in the task definition and of course
0:19:29different data in it
0:19:31hard to sort those out of it
0:19:33a different vol
0:19:34no less
0:19:35what can we say about how well
0:19:38romances
0:19:39there
0:19:40ah
0:19:42um
0:19:43i think we hinted that before but
0:19:45three seconds um
0:19:48we see a
0:19:49it was lacking
0:19:50oh nine wounded or seven media
0:19:52anything ewing performance improvement but
0:19:55in the
0:19:56there can second bite out in the
0:19:59thirty second maybe we
0:20:01right progress
0:20:02a bit
0:20:08oh really
0:20:09look at a couple of individual languages
0:20:11uh
0:20:12that's for sure
0:20:13tend to do the same language uh
0:20:15oh nine
0:20:16O seven in the
0:20:17of the 'cause O nine minutes of seven and the colours are one of the three durations
0:20:22and here
0:20:24to kind of language in which they were we have language
0:20:26pair since
0:20:27for korean
0:20:29oh
0:20:31we haven't seen improvements
0:20:32throughout
0:20:33right
0:20:34but
0:20:35the recycling three in two thousand nine is
0:20:38uh
0:20:38perfect the results are are
0:20:41ah
0:20:42languages
0:20:43part is one
0:20:44we see the overall having the
0:20:46we sing for the evaluation the whole
0:20:48ah
0:20:49improvement at three seconds uh
0:20:51a little change or even ridge regression
0:20:54thirty five
0:20:55and of course there are going to do that or
0:20:57new this year
0:20:59as well
0:21:04oh
0:21:04also here
0:21:06but dialect kind of has to be done previously to that
0:21:10american english and
0:21:11indian english uh
0:21:14uh
0:21:15that and we
0:21:16do see improvement like two thousand nine
0:21:19which is that the minutes
0:21:21thirty seconds
0:21:24and second
0:21:27and even more
0:21:28uh
0:21:34uh
0:21:35predicament
0:21:37a big there's three seconds
0:21:38american indian english
0:21:42and
0:21:42going to
0:21:44in the or do
0:21:46do you
0:21:47known to be a challenging language here
0:21:49but we see improvement thirty seconds
0:21:52three seconds
0:21:55yeah
0:21:55there's ten seconds
0:21:59oh
0:21:59and wait
0:22:00a three seconds
0:22:01ah
0:22:02three seconds
0:22:03well maybe this improvement
0:22:04yeah
0:22:05but have it
0:22:06three seconds in the order was that
0:22:09or too hard
0:22:10comparison
0:22:11performance little better than
0:22:13and random
0:22:16your words in summary
0:22:19are we experiment with a new
0:22:21data collection paradigm
0:22:23and we're reasonably satisfied with that producing a
0:22:26and effective evaluation get berkeley
0:22:29have trouble performance
0:22:31repeating this trick when the right data for future evaluations that remains a challenge
0:22:36uh we shall continue performance improvement
0:22:39uh of having a son
0:22:41a real nice based on the
0:22:43shorter segments
0:22:46um
0:22:48for both coding open say condition
0:22:51a language
0:22:52pairs was introduced
0:22:54here in particular for marketers it
0:22:56relative interesting poses challenges more likely
0:22:59you part of any
0:23:00in in
0:23:00if your evaluation that we do
0:23:03um
0:23:03this story
0:23:05an issue we've argued about about
0:23:07whether used actors average cross language
0:23:10and i think that yeah
0:23:11uh includes might
0:23:13right off
0:23:14thank you
0:23:21and
0:23:22information
0:23:30this is
0:23:30just
0:23:31a common
0:23:32on the
0:23:33comparing
0:23:34uh
0:23:34she happens
0:23:36tween
0:23:37uh
0:23:37that's done
0:23:38nist evaluations yes with the
0:23:40number
0:23:41target languages
0:23:42uh
0:23:43that's
0:23:45uh
0:23:46it
0:23:47uh
0:23:48they're more languages than the weight vanished that's it
0:23:52the hypothesis
0:23:55mostly
0:23:57so
0:23:58there are more languages
0:24:01uh
0:24:01you know list
0:24:02about five months
0:24:04'cause you know
0:24:05you less sure about which one
0:24:07to be
0:24:08so
0:24:09um
0:24:10that makes it a little bit on that
0:24:13it makes it a little bit harder
0:24:14just one
0:24:16makes it a little bit
0:24:17not not a lot
0:24:19if you were doing just fine
0:24:21identification
0:24:22obviously
0:24:23the number of languages as a strong stick
0:24:26second autistic
0:24:28which
0:24:29which don't have
0:24:31so
0:24:32arguably if we just
0:24:33apparently tread water but it made the problem are doing
0:24:36introduce language people haven't seen before
0:24:38i'd argue that
0:24:39that
0:24:40it'd be apart
0:24:41it's also predicate argument for the
0:24:43language pairs condition which
0:24:45well
0:24:47so
0:24:47tenderly
0:24:48i think that affect yeah
0:24:54we should
0:24:57you plan to
0:24:58to use it
0:24:59voice of america
0:25:01uh it uh for the nist evaluation
0:25:04oh
0:25:06there
0:25:07other than your right
0:25:08he
0:25:08it we need to discuss this with i don't
0:25:11think
0:25:12we can hope thing
0:25:13just get more voice of america data we're
0:25:16exploring
0:25:17um
0:25:19other
0:25:20similar type
0:25:21or or that may be available that have multiple languages
0:25:25um are there any
0:25:26recommendation that people with them
0:25:34yep
0:26:02uh i'm honestly wondering why
0:26:04four
0:26:05uh
0:26:06identification
0:26:08oh
0:26:08just
0:26:09sure
0:26:10so make or break
0:26:11two
0:26:12cation
0:26:13four
0:26:14i mean
0:26:14to do that
0:26:15uh
0:26:16you should
0:26:16and i'm i'm
0:26:18and you are using uh
0:26:19uh that it
0:26:20editions
0:26:21oh
0:26:23um
0:26:24i would like to do i need to find a direct
0:26:27just
0:26:27hmmm
0:26:28you know
0:26:29to
0:26:30yeah
0:26:31see
0:26:31oh
0:26:32and identification
0:26:34and i wonder why
0:26:35um
0:26:36you you could try your interesting identification with
0:26:40recognition
0:26:42because
0:26:42it's a
0:26:44whatever
0:26:44this
0:26:46if you use
0:26:47you
0:26:48correlation
0:26:48we thank you
0:26:50right but
0:26:51there's no
0:26:52no
0:26:53you you can
0:26:54some
0:26:55oh
0:26:56accuracy
0:26:57yeah
0:26:57yeah
0:26:58i i always wonder why
0:27:00i wanna see how well it does
0:27:02yeah
0:27:04yeah
0:27:05you use you understand
0:27:06well
0:27:08and i am not
0:27:09sure you're saying you're interested in
0:27:11bring in distinguishing particular
0:27:13mostly related to i
0:27:15or are you saying i think of the identification problem
0:27:19yeah the language of their and possibilities which one is it
0:27:23it yeah i
0:27:24yeah
0:27:25you target for that
0:27:26dialect
0:27:27i think
0:27:27yeah i'm interested in education
0:27:30like
0:27:31see
0:27:31a comparison
0:27:33uh
0:27:34this
0:27:35if you use your
0:27:37oh
0:27:38but i mean the language here
0:27:41condition does that computing
0:27:44yeah
0:27:45but
0:27:45but
0:27:46yeah
0:27:47what
0:27:48you have
0:27:49oh
0:27:50oh
0:27:52right
0:27:52yeah
0:27:53okay
0:27:54oh
0:27:55uh
0:27:56no
0:27:58one
0:27:58like
0:27:59yes
0:27:59the
0:28:00i comparison
0:28:03oh
0:28:04not
0:28:06yeah
0:28:06and
0:28:07and
0:28:08i
0:28:09okay
0:28:10yeah
0:28:10i think
0:28:11yeah
0:28:11huh
0:28:12and
0:28:13right
0:28:14yeah
0:28:15no
0:28:16yeah
0:28:19as opposed to
0:28:23i'm not okay nectar
0:28:24a couple of that but maybe that's something we can talk about for the wrong one
0:28:35yes
0:28:35right
0:28:35i can like
0:28:36combine
0:28:37um
0:28:38uh
0:28:39i've
0:28:42it's one thing
0:28:43so
0:28:44yes
0:28:45um
0:28:46i've uh
0:28:47i think that's
0:28:48discuss
0:28:49but
0:28:49yeah
0:28:51uh
0:28:51qualitative
0:28:52uh
0:28:54this and
0:28:55and other what this
0:28:57oh
0:28:58someone is
0:28:59so
0:29:00elements of this
0:29:01the
0:29:01the
0:29:02pulling of the day good
0:29:04and equal error rate being one point
0:29:06oh
0:29:07on the go
0:29:08it's part of that discussion
0:29:10um
0:29:11so
0:29:12i'm not going to
0:29:14start that again
0:29:15no
0:29:16uh
0:29:17i think i have something useful to say about
0:29:19average
0:29:20which
0:29:21so
0:29:22if you doing
0:29:24identification
0:29:25uh
0:29:27given a speech segment
0:29:28you told
0:29:30you're in languages
0:29:32speech segment can be in one of these in language
0:29:35then
0:29:36uh
0:29:36you also have to assume some prior
0:29:39so you can assume a flat prior
0:29:41of the of those languages that you would
0:29:44uh
0:29:47yeah
0:29:47likely
0:29:48uh
0:29:49before you look
0:29:50the speech
0:29:51that that would be
0:29:52uh
0:29:52the identification problem
0:29:54so what nist is done
0:29:57is
0:29:58that i
0:29:58uh
0:29:59so
0:30:01something this problem
0:30:02if they're in languages at in doesn't apply
0:30:06so
0:30:08the in doesn't primes is
0:30:10uh
0:30:12target language number one
0:30:13as a prior
0:30:14oh
0:30:16and
0:30:16all of the other languages
0:30:18uh
0:30:19susan between them
0:30:20uh
0:30:22oh a probability of heart
0:30:24so it's
0:30:25it's just you try
0:30:27and
0:30:28then
0:30:28you go to the next topic
0:30:30two
0:30:30you say you know this one has probability of false
0:30:33all the others
0:30:34uh
0:30:35i have a smaller probability
0:30:37then
0:30:38you missus
0:30:39D
0:30:40uh
0:30:41uh
0:30:42essentially i didn't
0:30:43cation yeah right
0:30:45given that probably
0:30:46in times
0:30:47and you and all those
0:30:50it it right
0:30:51that's the that's the secret
0:30:54um
0:30:56so
0:31:00okay
0:31:02and he to be
0:31:03to go on
0:31:04the next
0:31:04speaker
0:31:05again
0:31:07interesting
0:31:12yeah