?

Log in

No account? Create an account

Lazyweb assignment #1 for search engine geeks

« previous entry | next entry »
Nov. 10th, 2004 | 02:04 am
mood: geekygeeky
music: Orbital - P.E.T.R.O.L.

Given:
an arbitrary list of URIs

Produce:
a google search string which will return ~90% of those URIs within the first N results


Extra credit:
Optimize for shortest possible (or practical) search string
configurable options and parameters

Link | Leave a comment |

Comments {3}

truth without proof

(no subject)

from: chronicfreetime
date: Nov. 11th, 2004 10:52 am (UTC)
Link

It's one of the first things I want to implement, assuming they hire me. The "find similar" button does a lousy job at present, I'd like it to use the content more than the URL.

Feeding the top 5-10 words by TFIDF back into a query should work pretty well, and it's very easy to compute from the index they already have.

Reply | Parent | Thread