First Test Of Computer Jeopardy Player Goes Well; Watson Beats Mere Humans

from the show-off dept

Last month, we wrote that the IBM computing project Watson was ready to take on real Jeopardy contestants in February of this year (just a few years after the project first came to life). While the big test isn’t until Valentine’s Day, apparently they had a dry run, and things are looking pretty good for Watson, who beat Jeopardy champions Ken Jennings and Brad Rutter. It wasn’t a complete domination. Apparently Watson just narrowly edged out Jennings, though Rutter couldn’t keep up. Of course, between now and the real test, Watson can be tweaked. Jennings’ and Rutters’ brains are pretty much set, and I’d imagine that their ability to cram more useless trivia in their brains is outmatched by Watson.

Filed Under: , ,
Companies: ibm

Rate this comment as insightful
Rate this comment as funny
You have rated this comment as insightful
You have rated this comment as funny
Flag this comment as abusive/trolling/spam
You have flagged this comment
The first word has already been claimed
The last word has already been claimed
Insightful Lightbulb icon Funny Laughing icon Abusive/trolling/spam Flag icon Insightful badge Lightbulb icon Funny badge Laughing icon Comments icon

Comments on “First Test Of Computer Jeopardy Player Goes Well; Watson Beats Mere Humans”

Subscribe: RSS Leave a comment
44 Comments
Chronno S. Trigger (profile) says:

Re: Re: Re:

A 6th grader can find the question with a Google search, but that 6th grader has the intelligence to take that answer and turn it into a keyword search, taking into account the word play, and sifting threw the many, many responses for the question that fits.

Computers aren’t good at figuring out what information can be ignored and what words are being used in ways that aren’t in the dictionary.

Marcus Carab (profile) says:

Re: Re: Re:

A 6th-grader’s brain comes pre-installed with the near-magical capability of abstraction. THAT’S what the challenge was here. Perfect recall is no huge feat – which is why even people like Jennings are only really impressive in a very limited way. But a computer learning to abstract and recognize complex patterns in order to answer natural english questions – that is tough

vivaelamor (profile) says:

Re: Re: Re:

“I do not really consider this being a big thing. This is like saying a 6th grader beat Ken Jennings using only his skills to type the question in Google.”

Another point worth making is that Watson is completely offline. While they feed it presumably more books than any human could read in their lifetime, it cannot just rely on the question having already been answered in an easy digest form on Google.

nasch (profile) says:

Re: Re:

Anything already invented is just a program, artificial intelligence is always whatever we don’t know how to do yet. 20 years ago if you had described a computer that can beat a Jeopardy champion without assistance, that would probably be thought of as AI. Now to be AI it has to, I don’t know, be able to both drive a car and cook a meal. And after that’s invented, it won’t really be AI until it can understand human emotions. And so on. 😉

Richard (profile) says:

From My Introductory AI Lecture this morning

Before project:

“We will construct a program/robot that will perform task X (e.g. play Chess). The resulting program/robot will thus have artificial intelligence – since humans use intelligence to perform task X.”

After project

Scenario 1

“We failed”

Scenario 2

“We succeeded – however on closer examination it turns out that the method our machine uses is really some form of brute force algorithm that can’t really be called intelligent. It seems that you don’t really need intelligence to perform task X after all.”

Anonymous Coward says:

Re: Re: Re:

(and they installed preprogrammed answers ahead of time designed to answer those pre-planned questions. So it’s kinda like allowing someone to study a test bank ahead of time and then testing how well that person knows the answers to the questions in the test bank. Very little conceptualization required).

Anonymous Coward says:

Re: Re: Re:2 Re:

I’m not saying the above is what happened, the above is just an oversimplification of how a computer can beat a person.

My point is that if the computer has been asked these questions before and the answers that the computers algorithms will provide are known in advanced (and the computer has been tweaked to answer these questions correctly), and the people contending against the computer have never been asked these questions, then it’s not as surprising that the computer won. Sure, that still maybe a slight oversimplification, but the point is that the computer will naturally be more familiar with and tuned to answer the specific types of questions that its programmers will tend to ask along with being able to understand the specific wording that the programmers will tend to use (and respond with the expected wording).

Marcus Carab (profile) says:

Re: Re: Re:3 Re:

You are making this all up. IBM has no idea what questions the computer will face – sure it has some concept of the type of question, just like any human practicing to be on Jeopardy. But the questions Watson faces are new and unexpected.

And the whole concept of a “type” or “nature” of question is exactly the point here. Humans can abstract and recognize the “type” of a bunch of complex and varying things like trivia questions. To a computer, each one is a completely unique string of characters that can only be parsed with strict logical conditions – same goes for every single one of the facts it has in its database of knowledge.

This is obvious even at basic levels of computing: spellcheckers work well, grammar checkers do not. That’s because spelling is a (for the most part) strictly defined set of rules – the computer simply checks each word against its dictionary, seeking an exact match. Grammar, on the other hand, is highly complex with varying degrees of strictness and countless structural options: there are “rules” but they are not nearly as simple or clear, always being full of exceptions and intricate conditions. Now, even though most people cannot explain all the rules of grammar, we instinctively know when a lot of things are right or wrong just based on how it “sounds” to us – and we can’t even explain what we mean by that, or how that process works in our head. Computers have no such skill.

vivaelamor (profile) says:

Re: Re: Re:4 Re:

‘And the whole concept of a “type” or “nature” of question is exactly the point here. Humans can abstract and recognize the “type” of a bunch of complex and varying things like trivia questions. To a computer, each one is a completely unique string of characters that can only be parsed with strict logical conditions – same goes for every single one of the facts it has in its database of knowledge.’

Two terms worth throwing in here for anyone wanting to learn more about the sort of processes I assume Watson uses: fuzzy logic and neural network.

Anonymous Coward says:

This is no real contest I’m afraid. This machine is so different from Big Blue that competed in chess. This is simply a search and seizure operation, you ask a question and it searches it’s enormous database and finds the answer. For all we know it may just Google it to get the answer. The computer has an unfair advantage unless you give the humans the same access to the data.

Marcus Carab (profile) says:

Re: Re:

It’s really not that simple. I can see how it seems that way at first, but in fact this was a massive AI challenge. It’s not having the knowledge that is difficult (and no, it has no internet access) – the challenge is parsing the wording of the Jeopardy questions to figure out what the answer is. Remember: computers don’t understand English. They just fake it, and usually not very well.

There are lots of articles out there explaining why Watson was a very real AI challenge – and no they are not all just articles by IBM. You should definitely look more into it because it’s actually pretty fascinating and it truly is a major breakthrough, if not in genuine “artificial intelligence” then at least in the natural language processing (which is one of the core building blocks of artificial intelligence)

Anonymous Coward says:

Re: Re: Re:

If I chose the questions, or set of questions to be asked, ahead of time then even I can create a computer program that can answer each question flawlessly and beat almost anyone who is being exposed to those questions for the first time. That’s hardly impressive. I will just write a test bank of questions, program them into the computer, program the answers into the computer, and ask those pre – programmed questions.

Were the people who created the program independent of those who chose the questions? Were those who chose the questions given any guidelines as to what kinds of questions should be asked, what kinds of questions the computer has answers to, and how the questions should be worded? This test run was done at IBM’s headquarters and so it was done in a controlled environment. Practically any beginner programmer can create an excellent Jeopardy contender in a controlled environment if the set of questions and answers were predetermined. We must know how prepared the computer was to answer the set of questions that were being asked ahead of time in opposed to how prepared it was to answer any random set of questions.

Marcus Carab (profile) says:

Re: Re: Re: Re:

Where does this whole “pre-chosen questions” concept come from? You are just making that up.

That’s not what happened. And yes, the people creating the program ARE independent. Maybe you didn’t notice but this is an IBM computer competing at Jeopardy – IBM does not own Jeopardy and Jeopardy has no interest in helping them win. Besides, both sides have made it clear that the challenge is not altered in any way from a typical Jeopardy match – so unless you are accusing everyone involved of lying to our faces, I have no idea what you are talking about.

This is a genuine challenge, but you for some reason insist on believing it’s a massive cheat. What has given you this idea?

Anonymous Coward says:

Re: Re: Re:2 Re:

Hmmm…may I interject here?

Jeopardy was known to fix their games for a long time, this is history not story, so I would be careful not believing they could just lie to our faces they would if they could get away with it LoL

Other then that I doubt IBM would want that, because that system would be eventually be open to scrutiny and the truth would come out and destroy their good reputation that helps them sell billions in hardware. That is why I don’t believe they are cheating here, they don’t do that kind of thing because it would cost them a lot of money.

Marcus Carab (profile) says:

Re: Re: Re:3 Re:

Hmm – I didn’t know that about Jeopardy. But definitely looking at this situation there are no benefits to either side for cheating: Jeopardy would rather have a human win, because the humans sort of represent them in this rivalry – and a computer beating their game would trivialize it in a lot of people’s eyes. Meanwhile IBM, as you say, gains nothing by pretending to have made a breakthrough in AI, except perhaps a momentary stock bump before they get found out and torn to shreds.

Anonymous Coward says:

Re: Re: Re:4 Re:

“I didn’t know that about Jeopardy.”

Many game shows have been known to do that, not just Jeopardy, though that wasn’t my point here. My point here, from the article

“was a test run this morning at IBM Research’s headquarters in preparation for a televised weekend challenge”

From my understanding, this wasn’t done by Jeopardy, it was done by IBM. To what extent were the selected questions created by people not involved in the development of the computer hardware, software, and database info and to what extent did those people creating the questions receive guidance as to how to word the questions, how accepted answers should be worded, and what kinds of questions to ask and which ones to steer away from.

Anonymous Coward says:

Re: Re: Re:5 Re:

(and guidance as to what expected answers to expect so that questions can be biased towards ones with those answers). I’m not saying it’s necessarily fixed, just that those developing the system will naturally have a tendency to ask certain kinds of questions, even if they’re not conscious aware of it.

vivaelamor (profile) says:

Re: Re: Re:6 Re:

“I’m not saying it’s necessarily fixed, just that those developing the system will naturally have a tendency to ask certain kinds of questions, even if they’re not conscious aware of it.”

What do you base that on? If their goal is to improve the system then they may well ask it questions that it is less likely to get right, a completely opposite bias to the one you assume.

vivaelamor (profile) says:

Re: Re: Re:5 Re:

“From my understanding, this wasn’t done by Jeopardy, it was done by IBM. To what extent were the selected questions created by people not involved in the development of the computer hardware, software, and database info and to what extent did those people creating the questions receive guidance as to how to word the questions, how accepted answers should be worded, and what kinds of questions to ask and which ones to steer away from.”

Why would IBM cheat on the not actual Jeopardy test? It’s obvious why they wouldn’t cheat: they can test it against the actual competitors and tweak the systems in preparation for the real test. Are you seriously suggesting that they’re not confident of their chances in the real test and are using this trial run to make it look like they did well?

Anonymous Coward says:

Re: Re:

Yah! bug you neglect the concept behind it, you take for granted the human ability to do so, is not that simple to make a computer do it, because we don’t understand how we work.

If you doubt please try to make a search engine that understands the meaning of words and can keep track of the many many double meaning things have.

Ryan Diederich says:

Much more complex...

The programming here is much more complex than many realize. This thing has to be able to define the categories, and then figure out what the definition of the category means to each question.

For example, a question might read: 1812

The AI would have to remember that the category was “wars in history” and then apply that to its database search.

Of course that one was simple, there are many more complex ones that I cant think of.

I do remember one category that was, in other words, “Poets with the same letters for their first and last name”

Now, unless you were expecting this category, how on earth could you program something to realize what that means.

Jason says:

More difficult than that would be the intuition type categories like Rhyme Time and Before and After. In these cases you have to get the idea of how the answers work before you can start answering correctly.

What I’m wondering: Are they trying to anticipate common categories and programming this type of information into the computer or simply relying on some kind of intuition algorithm (?).

vivaelamor (profile) says:

Re: Re:

“What I’m wondering: Are they trying to anticipate common categories and programming this type of information into the computer or simply relying on some kind of intuition algorithm (?).”

I’m guessing that if they needed to then they could a combination. I think it is more likely that such a step would be redundant because such a thing should be easily and more effectively learned by the machine anyway, e.g. by inputting previous actual questions. If every question that had the word ‘dig’ in the title turns out to be about archeology then the computer will learn that.

Add Your Comment

Your email address will not be published. Required fields are marked *

Have a Techdirt Account? Sign in now. Want one? Register here

Comment Options:

Make this the or (get credits or sign in to see balance) what's this?

What's this?

Techdirt community members with Techdirt Credits can spotlight a comment as either the "First Word" or "Last Word" on a particular comment thread. Credits can be purchased at the Techdirt Insider Shop »

Follow Techdirt

Techdirt Daily Newsletter

Ctrl-Alt-Speech

A weekly news podcast from
Mike Masnick & Ben Whitelaw

Subscribe now to Ctrl-Alt-Speech »
Techdirt Deals
Techdirt Insider Discord
The latest chatter on the Techdirt Insider Discord channel...
Loading...