Google follows Yahoo! in personalizing search

Google hurries up after Yahoo! yesterday MyWeb2.0 announce: Google Relaunches Personal Search - This Time, It Really Is Personal.
Huh? Google gives an example (not yet posted live) that says:
For the query [bass], Google Personalized Search may show the user results about the instrument and not the fish if that person was a frequent Google searcher for music information

How would Google know you are a frequent music information searcher? It could monitor the types of queries you do and use various methods to tell if you seem to be searching for music information often. But another method — and one using technology Google has already has demonstrated — is to monitor what you click on in the results.

(FYI, a Google patent on personalization based on bookmarks that recently came to light is covered in this SEW Forums thread and in great depth in this Cre8asite thread. Another recently discussed patent also covers things like using clickthrough measurements to refine results. In addition, Google has personalization technologies and patents from past acquisitions, such as Outride).

Patents on personalization based on bookmarks?!? Google recording all the links you have ever clicked and using them?!?
I think I’m going to switch from Google to Yahoo!, just to foster diversity. Yahoo! is more Flickry and more reassuring to me at the moment.

Software Patents WebStrike: “close” your web site to Save Europe from Software Patents!

My homepage is closed. I’m participating to the Web Demo Against Software Patents. Please, do the same!
On 6th July in Strasbourg the European Parliament could Save Europe from Software Patents
The Software Patents Directive, as approved by the European Council of Ministers, would codify US-style Software Patents in the European Union.
If that happens, software developers will no longer own what they write and can be sued for selling or distributing their own software.
If you don’t inform your parliament, mega-corporations are doing the job for you: “The European Parliament is filled with lobbyists of Microsoft, Eicta, CompTIA and so on. There are 30 to 40 lobbyists permanently roaming the halls.” (in Eweek, 21 June (http://www.eweek.com/article2/0,1759,1829955,00.asp))

What you can do
1. Participate in the Web Demo until the vote of 6th of July
2. Please send nice faxes or make phone calls to your representatives in the European Parliament (http://www.ffii.org/~gibuskro/meplist/) and ask them to follow the FFII voting recommendations (http://europarl.ffii.org/amendments.en.html). You can also ask them to follow the rapporter Rocard.

Live8 and Africans


Ethan Zuckermann ponders about Live8:
But in the age of citizen journalism, it’s pretty easy to hear what
smart, opinionated Africans think about Live 8 directly from their blogs. I just did a roundup of African bloggers writing about Live 8 over at Global Voices.
You may be unsurprised to discover that, generally speaking, there’s less enthusiasm for Live 8 on the continent than there is in the US or UK.
While it’s admirable that thousands of bloggers have added
to their pages to promote Live 8, to support African debt relief or to try to revive Bob Geldof’s career. But it would be a damn sight more useful and transformative if bloggers would go a step further and start reading some African bloggers… perhaps starting with some of the folks who are justifiably skeptical about the value of yet another rock concert. Allow me to recommend Thinker’s Room’s “Live Aid? Please!”, Sokari Ekine’s “Live 8419″ or Gerald Caplan’s brilliant piece in Pambazuka.

SocialSearch: risk of moving from “tyranny of the majority” to “the daily me”

On Yahoo!Blog, while presenting its new MyWeb2.0:
The answer a web search engine delivers is what it believes is the correct answer for the majority of users – often referred to as “the tyranny of the majority”. For example, when you search for ‘apple’, the first result on most search engines is Apple Computer. But you may have been searching for information about the fruit or Apple Records.
This is a point I’m making since some years and so I totally agree that this is a problem of current search engines and I totally agree that considering personal trust networks of users is the solution to go (actually this is my my PhD research topic).
But I want also to point out, as I already did some time ago, that on the other extreme (total personalization) there is another, maybe bigger, risk: “the daily me”.
If you only see web sites, opinions, movies, etc of people you already agree with, you will never ever meet new, unexpected points of view, you will never ever need to argue your points with someone that thinks different (and possibly change your mind, at least a little bit), you will simply exacerbates your opinions, you will end up not even being able to understand the language used by people that are not in your “community” of like-minded friends!
If you are an anarchist speaking/reading only other anarchists, you will strengthen your opinions, they will become more extreme. Or if you are a catholic orthodox, or … The same is true for every group: liberals watching and reading mostly or only liberals; moderates, moderates; conservatives, conservatives; neo-Nazis, neo-Nazis. The resulting divisions run along many lines–of race, religion, ethnicity, nationality, wealth, age, political conviction, and more. Most whites avoid news and entertainment options designed for African-Americans. Many African-Americans focus largely on options specifically designed for them. So too with Hispanics

This will produce extremism and fragmentation of society and could have terrible, violent consequences.
The great book of Cass Sunstein Republic.com analyses this risk and more importantly tries to suggest a range of potential reforms to correct current misconceptions and to improve deliberative democracy and the health of the American republic.

(…)
First, people should be exposed to materials that they would not have chosen in advance. Unplanned, unanticipated encounters are central to democracy itself. Such encounters often involve topics and points of view that people have not sought out and perhaps find quite irritating. They are important partly to ensure against fragmentation and extremism, which are predictable outcomes of any situation in which like-minded people speak only with themselves. I do not suggest that government should force people to see things that they wish to avoid. But I do contend that in a democracy deserving the name, people often come across views and topics that they have not specifically selected.

Second, many or most citizens should have a range of common experiences. Without shared experiences, a heterogeneous society will have a much more difficult time in addressing social problems. People may even find it hard to understand one another. Common experiences, emphatically including the common experiences made possible by the media, provide a form of social glue. A system of communications that radically diminishes the number of such experiences will create a number of problems, not least because of the increase in social fragmentation.

I think it is time that everyone of us (especially those involved in creating personalized services, and hence in this case, especially Yahoo!) should start thinking about this problem before we are too ahead in the future. What do you think?

Social Search via Trusted Friends (by Yahoo!)

Via Many2Many, I learn that Yahoo has a new gift for us: My Web 2.0 BETA - A Social Search Engine.
On Yahoo! Search Blog there is an introduction to the new service:
Almost two years ago, one of our engineers was interested in buying a plasma TV and tried using web search to find a good site for reviews — a quick search revealed that there were hundreds of sites offering to educate him on plasma TVs, yet short of visiting all the sites, it was difficult to figure out which site exactly was the ‘best’ site. So he did what millions of people do every day – asked a friend, who recommended two excellent sites for plasma TV reviews. He never ended up buying a TV (things just got too busy with search), but this was the moment of inspiration that lead us to build the product we are introducing today – a social search engine that enables people to search the expertise of their friends and community. [Read the rest on Yahoo! Search Blog]
All this trust-enhanced search and filtering and recommendations is what I’m trying to do with my PhD and I’m happy it is really taking off. About technology and metrics, they propose MyRank, as a successor of global trust metrics such as PageRank or the more recent TrustRank.
I didn’t play with it for now. And I really hope that Yahoo! is going to embrace the Open Web with this new service by giving Open API and export facilities (at the moment it seems so and I’m sure Flickr guys are going to suggest the right moves about that).
Ken Norton is collecting reactions from the blogosphere, or you can monitor who is speaking about My Web 2.0 on Technorati or BlogLines.
Otherwise read My Web 2.0 Blog (first post is by our great flickresque friend Caterina Fake).

Don’t yawn!

Don’t yawn game. Tell me if you yawned or not (via delicious/popular).

Presentation on “Trust in Recommender Systems: an historical overview and recent developments”

I was invited by Stefano Mizzaro to give a lecture in his course in “Web Information Retrieval”. I spoke about “Trust in Recommender Systems: an historical overview and recent developments”. It was a lot of fun (at least for me). And I thought I could share the slides with you. They are in OpenOffice .sxi format (it is an open format, so if you program does not read a commonly used open format, you probably better change it). They are released under a Attribution-ShareAlike Creative Commons licence. This means that if you want to use them you just have to give credit to me and re-share your slides under the same licence. If you don’t want to re-share your derivative work under the same Creative Commons licence, you are still free, free of not using them. Enjoy.

My suggestion to Google: getoutfoxed(’s author)

Google, do hire Stan before Yahoo! does it. Stan is the author of “Outfoxed - Personalize your internet.” I didn’t play with the code yet (seems a Linux version is not yet ready at the moment, but on the way). Yes, the code is open source (Mozilla Public Licence), sweet! Anyway, the detailed description is fantastic! It is a bit like what I want to do for my PhD thesis. The difference? Stan did it! Check the site: it has a lot of interesting pages such as The Outfoxed Idea (A collection of thoughts on the theoretical aspects of Outfoxed, and the whole idea of using social networks for metadata distribution). Or at least the page A Third Phase of Internet Search in which Stan pictiorally shows the 3 phases: Naive trust –> PageRank and inferred quality –> Social networks to determine subjective quality
Continue Reading

GreaseMonkey on Trenitalia.com

Trenitalia.com (Italy’s public railways) has some links that works only on IExploder. Few weeks ago you could do nothing but sending tons of email asking Trenitalia.com to support standard (you can also sign a petition for Making Internet Explorer Standards Compliant and hope).
BUT NOW you can GreaseMonkey it! [you need the great Firefox browser] Install the Trenitalia Link Fixer script that fixes wrong links in Trenitalia website.
(via blackbirdblog)

GreaseMonkey is the real Semantic Web (and now works on HospitalityClub)

GreaseMonkey is an extension for Firefox that allows you to totally (and easily) change the layout of any received web page. Don’t like the color of the banner of that_site.com? You can change it! Do you prefer to have the login link on the_other_site.org on the right? You can place it wherever you want! While visiting the page of a certain book on Amazon.com, do you want to see the prices other sites ask for the same book (with this information embedded on “original” Amazon page)? You can do it (with BookBurro extension)! Want to hide forever every Google AdSense ad? You can do it! You find hundreds of scripts (for hundreds of different sites) over at GreaseMonkey UserScripts wiki or you can easily create yours (as I did, see the end of this post).
Oh yes, this will blow up your business model and “any kid with a bright idea and a knack for DHTML can create a new interface for your site, and it will probably be better than yours.
And yes, this is much much more real (and useful) than all the Semantic Web you listen about at conferences (with tons of papers and tons of highly funded programs that, at least at the moment, produces almost nothing you can use and play with; if I’m wrong, use the comment to point out interesting stuff).
Anyway, I played a bit with GreaseMonkey. I recommend you diveintogreasemonkey by Mark Pilgrim and I suggest you to follow it step by step (this is faster than trying to jump to what you need because you will jump back to understand that what you skipped was important).
And eventually, I created 2 GreaseMonkey scripts for HospitalityClub, that I think can save me a lot of time in using the site. I used HospitalityClub for finding hospitality in Trieste when I was attending the School on Networks (thanks truesmile and inquis), I used it in order to find hospitality in Pittsburgh where I’ll be for the AAAI conference (thanks roder) and yesterday I wanted to use it for finding hospitality for my (short) holidays in Italy [not going to tell where]. The problem with HospitalityClub is that the interface is not too usable. My usual use case is the following: I search all the people offering hospitality in the place where I want to go, and I send to all of them the same request. This requires visiting the list of users, clicking on every username to go to her userpage and, on the userpage, click on “send message to this user” that leads to a new page, then copying my name in a field, my passport number in another field, the request text in a text area and push Submit. All these steps must be done for all the users!
So I created a GreaseMonkey extension that add a link near every username: the link allows to go directly to the “send message” page.
      [ script: hospitalityclub_addSendMsgLink.user.js ]
And I created another extension that prefill the values in the “send message” page with the default ones (my username, my passport number, the request message).
      [ script: hospitalityclub_defaultValuesInMsg.user.js ]
In this way you just have to push Submit. It would be possible to push Submit automatically with the extension but I wanted to keep some control … interestingly GreaseMonkey gives you so much power that then your small brain is no more able to manage it. I mean, for example, I have at least 4 extensions that modify google.com pages and I’m no more able to tell which extension inserts what in which cases… this is something I need to think a little bit more about.
Anyway the 2 extensions are released under GPL (software that gives you freedom) so you are free to play with them, free to study them and free to modify them. Enjoy!