« True Knowledge now knows about Five Million entities | Main | True Knowledge answering more and more »

07 March 2009

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/services/trackback/6a00e54f8a936688340112793f5d4a28a4

Listed below are links to weblogs that reference How to answer "Where is Madingley?":

Comments

Nomlas

I would have thought that this is a question whose answer depends on the context of the person asking the question. Labelling English counties as interesting is good for someone in England, or maybe an ex-pat Brit (who still has a knowledge of England). But for someone in China, English counties are probably irrelevant.

If you had some way of knowing about the user, then you could tailor the answer to them. For instance, provide an appropriate amount of context to describe answers, and also rank answers so that e.g. nearest place is first. Could this be an optional feature of TK accounts?

Nicholas

You need to up the number of facts by two or three orders of magnitude. Hand‐entering them is simply not a feasible way of proceeding. I recommend you write two sets of automated web crawlers. One that targets specific sites such as wikipedia and uses knowledge about things like info boxes to import lots of data automatically. Secondly a more general crawler that spots common patterns of fact assertion on any website with a high trust value (pagerank?).

You can exclude some assertion patterns too, such as those that begin with "your mother is". They could be held in a clearing house until they can be verified if the usage is suspect, or trivial—"[Nicholas Shanks] would like [a pepperoni pizza for lunch] was true from 9 March 2009 onwards" is a pretty useless fact that you might find online, and can be automatically filtered out.

Other importers would also be useful, such as GEDCOM and FoaF for people/relationships/genealogies, and a general RDF ontology parser/importer for accumulating random data. RDF triples are well suited to how your system works.

Beth

We already do automatically import facts. If you look throught the recent activity log you'll see lines like

15:58 yesterday true knowledge Added 34603 facts sourced from Freebase about human beings

I sure didn't type all those humans in myself!

Martin Griffies

I'd guess that you need a referent to the computer asking the question & its origin or IP address.

Thus: to a person in Cambridge (CB, UK), Madingley is just a few miles north.

To someone in Cambridge (Glos) it's in Cambridgeshire, and
to someone in Cambridge (Mass), Madingley is in England.

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been saved. Comments are moderated and will not appear until approved by the author. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment

Comments are moderated, and will not appear until the author has approved them.

Your email address:


Powered by FeedBlitz