Google News

Jan. 4th, 2003 07:33 pm
avram: (Default)
[personal profile] avram

I wonder how Google News selects its headlines. For example, look at this chunk of screenshot:

[ Google News paragraph ]

What quirk of software was responsible for choosing a headline from The Hindu, an Indian paper, as the top headline, and relegating the Washington Post and Yahoo to second bananahood? Not that I mind — I like the diversity of sources — I’m just curious.

I suppose The Hindu might have more readers, India being such a populous nation, but in another section a Newsday headline is top and the NY Times headline comes in second, and I doubt Newsday has more readers. More likely it’s based on filing time, or linked-to rankings.

(no subject)

Date: 2003-01-05 10:32 am (UTC)
From: [identity profile] miramon.livejournal.com
If I understand the way that Google News works, they do it by clustering (this is based on that next-in-sequence functionality that they were testing last year). They detect that there are a number of stories on a particular subject and so that becomes a subject. They then see how many stories fit into the group.

The lead story is the one whose headline best fits the group, the next two are ones with content that fits the group. If you look at the Washington Post's story it is very specifically about Pfizer inventing female sexual disorder. But the headline doesn't reflect that so it doesn't get the lead position.

April 2017

S M T W T F S
      1
2345678
9101112131415
16171819202122
23242526272829
30      

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags