0:00:07 | my name is tone each of are i mean assistant professor computer science i been |
---|
0:00:11 | at columbia for a little over five years now and might area of research is |
---|
0:00:15 | machine learning i direct the company machine learning lab and you have a large group |
---|
0:00:20 | of students doing several really exciting projects the machine learning lab is really about this |
---|
0:00:26 | velocity of combining complication computer science |
---|
0:00:30 | and marrying into statistics because there is so much data out there where we have |
---|
0:00:35 | not just an information age of an information overload age and the real hopes to |
---|
0:00:41 | use computers to help us make sense of the data automatically |
---|
0:00:45 | we want them to learn much with people learn and also work with the types |
---|
0:00:49 | of data that we care about so the text we read the images we see |
---|
0:00:53 | we want computers to be able understand that you to that we're generating everyday at |
---|
0:00:58 | a faster and faster rate |
---|
0:01:00 | in biology for example there's millions of variables extends of thousand genes and it's almost |
---|
0:01:06 | impossible to have someone look at this data and come up with a theory about |
---|
0:01:10 | how the biology works |
---|
0:01:12 | so increasingly side to succumbing to computer scientists and machine order saying |
---|
0:01:17 | we've made all these measurements there's just too many variables and machine learning is one |
---|
0:01:21 | of the few tools that really can work with this type of data machine learning |
---|
0:01:25 | can provide us with a network description of visualisation clustering production and so on which |
---|
0:01:32 | sciences finding very valuable these days |
---|
0:01:35 | another thing we've been working with its social network analysis and |
---|
0:01:39 | biological networks is another natural counterpart |
---|
0:01:42 | looking at networks of proteins figure out how they interact with the proteins functions are |
---|
0:01:47 | how expression levels vary over time i think machine learning is one of these |
---|
0:01:52 | unusually lucky feels in that |
---|
0:01:55 | the foundations it's working from are useful to many other disciplines |
---|
0:02:01 | are particular research is machine learning applied to really complicated problems and datasets |
---|
0:02:08 | where there is some additional structure that space so images transform in various ways if |
---|
0:02:14 | you see picture someone it rotates the latter the right or you move around somebody's |
---|
0:02:18 | face in an image you still recognizing and so we're trying to incorporate that same |
---|
0:02:23 | type of |
---|
0:02:25 | structure into all are machine learning algorithms we also design algorithms and machine learning models |
---|
0:02:31 | at work on sequences so then we can handle things like a string of text |
---|
0:02:35 | we've been able to do that very successful using machine learning by modeling the sequence |
---|
0:02:40 | structure of the text |
---|
0:02:43 | give me two documents are not tell you they were uttered by the same person |
---|
0:02:47 | or there are some subtle stylistic things between those two documents that say that this |
---|
0:02:51 | is in the same person |
---|
0:02:53 | in my mind that's the really exciting future direction from machine learning one of the |
---|
0:02:57 | areas we concentrate on my group which is how to incorporate some of this invariance |
---|
0:03:02 | we know exist special you picture a short you tell to you still recognise that |
---|
0:03:07 | actually see later on |
---|
0:03:09 | and that's kind of the key i think too many real-world problems that |
---|
0:03:15 | invariance for that particular problem |
---|