|
Thursday, October 7, 2004
Bus. Std: Searching the Net -- New Technologies
My latest column in Business Standard: One of the defining battles in the mid-1990s was between Netscape and Microsoft over control of the desktop. Netscape threatened Microsoft’s Windows lock with its web browser. Microsoft fought back with a vengeance and finally won, as a marginalized Netscape was bought by AOL. Now, there is another battle that’s shaping up which could be equally defining for the future of computing. This time, the attacker is Google. Over the past few years, Google has become the search engine of choice. As its dominance has soared, so have its ambitions. Over the past couple years, Google has extended itself beyond search to other areas organically and via acquisitions. In recent times, the wheel has come a full circle with speculation rife that Google may be planning to launch its own browser. There are two parts to the story as we see unfolding as Google, Yahoo and Microsoft, along with a host of others, work to define tomorrow’s interface to the information web. The two parallel threads consist of building better search engines and creating richer interfaces. The search engines are the backend to solve the information overload problem, while the interfaces are the doorways to the world of content and applications. We will first discuss advances in search technologies. Later, we will look at how we will access this emerging world of “service-based computing.” The problem of search is one of plenty. There is a lot of data on the web that needs to be converted into useful information. Search is one the solutions to the proliferation of data that has taken place with the growth of the Internet. As John Battelle of Searchblog put it recently: “Search is our response to the extraordinary info-abundance in which we're all awash.” Google’s PageRank technology helped it separate the wheat from the chaff. In a recent article on Google’s history, the Economist (Technology Quarterly, Sep 16, 2004) explained how the algorithm works: “PageRank works by analysing the structure of the web itself. Each of its billions of pages can link to other pages, and can also, in turn, be linked to. [Google’s founders] Mr Brin and Mr Page reasoned that if a page was linked to many other pages, it was likely to be important. Furthermore, if the pages that linked to a page were important, then that page was even more likely to be important. There is, of course, an inherent circularity to this formula—the importance of one page depends on the importance of pages that link to it, the importance of which depends in turn on the importance of pages that link to them. But using some mathematical tricks, this circularity can be resolved, and each page can be given a score that reflects its importance.” The search of today can be considered in the C-prompt era, and needs an upgrade. So, what will be the Windows of the search era? In an interview with ACM Ubiquity, Ramesh Jain, professor of computer science at Georgia Institute of Technology, explains what needs to be done: “Current search engines like Google do not give me a ‘steering wheel’ for searching the Internet. The search engines get faster and faster, but they're not giving me any control mechanism. The only control mechanism, which is also a stateless control mechanism, asks the searcher to put in keywords, and if I put in keywords I get this huge monstrous list. I have no idea how to refine this list. The only way is to come up with a completely new keyword list. I also don't know what to do with the 8 million results that Google threw at me. So when I am trying to come up with those keywords, I don't know really where I am. That means I cannot control that list very easily because I don't have a holistic picture of that list. That's very important. When I get these results, how do I get some kind of holistic representation of what these results are, how they are distributed among different dimensions…Two common dimensions that I find very useful in many general applications are time and space. If I can be shown how the items are distributed in time and space, I can start controlling what I want to see over this time period or what I want to see in that space.” One glimpse of search innovation comes from Amazon with its A9 search engine, which is built around Google’s search results, and also integrates Amazon’s own book search results. John Battelle explained A9’s approach in a column for Business2.0: “A9 has broken search into its two most basic parts. Recovery is everywhere you've been before (and might want to go again); discovery is all that you may wish to find but have yet to encounter. A9 attacks recovery through its original Search History feature and its integrated toolbar, which tracks every site you visit. But new to this version of the site is a feature A9 calls ‘Discover,’ which finds sites you might be interested in based on your click stream and -- here's the neat part -- the click streams of others…A9 is more of a Web information management interface, with search as its principal navigational tool. [It is] betting that over time, Web users will come to recognize, then demand, that their search service not only find sites based on queries but also remember where they have been and what they have clicked on.” A few years ago, it seemed the search game was over. Results were inaccurate and portals were the thing to do. Google’s cutting-edge technology of linking resurrected an industry. Yet, the innovation in search is far from over. The game has just begun. In the next column, we will look at some key ideas which will define tomorrow’s search.
Softbank's Son on Broadband
AlwaysOn Network has excerpts form an interview with Softbank's Masayoshi Son (from the WSJ conference 'D: all things digital'):
Thin Client News
Linux Devices has a round-up: "Thin client terminals and systems have been around for decades, and Linux terminals have been around almost as long as Linux. Thin clients were even the focus of a brief computing craze in the late 90s, when IBM, Oracle, and Sun all launched ambitious "network computer" initiatives. The fad was shortlived, however, as Sun employees reportedly began bringing their laptops to work, and using their company-issue surplus SunRay thin clients (that no one was buying) as doorstops, while Oracle's New Internet Computer (NIC) spinoff died a quiet death in June of 2003. A year later, though, thin is back in again, however. IBM, Novell, HP, and Red Hat have all been talking up Linux-based thin clients. Thin client marketshare leader Wyse has redoubled its Linux efforts. And, the Linux thin-client pioneers like SmartFlex and Neoware haven't been idle, either. Even Microsoft has joined in; see this recent whitepaper for an outline of its thin client strategy."
Jotspot and Wikis
WSJ writes:
Jeremy Zawodny adds from Web2.0:
Insider Pages
News.com writes about the newest idea from Idealab's Bill Gross:
Some good ideas in there for PIN-News...
TECH TALK: The Network Computer: The Internet OS
Tim O’Reilly discussed the idea of the Internet Operating System (built around open-source software):
Tim added recently: “I'm talking about the emergence of what I've started to call Web 2.0, the internet as platform. We heard about that idea back in the late 90s, at the height of the browser wars, but that turned out to be a false alarm. But I believe we're now starting the third age of the internet -- the first being the telnet-era command line internet, the second the web -- and the third, well, that tale grows in the telling. It's about the way that open source and the open standards of the web are commoditizing many categories of infrastructure software, driving value instead to the data and business processes layered on top of (or within) that software; it's about the way that web sites like eBay, Amazon, and Google are becoming platforms with rich add-on developer communities; it's about the way that network effects and data, rather than software APIs, are the new tools of customer lock-in; it's about the way that to be successful, software today needs to work above the level of a single device; it's about the way that the Microsofts and Intels of tomorrow are once again going to blindside established players because all the rules of business are changing.” If there is one candidate to build the Internet OS, it is Google. Tomorrow: Google OS/PC/Browser Related Entries: [All]TECH TALK: Computing for the Next Billion: Network Computers [June 22, 2006] TECH TALK: The Network Computer: Making It Happen [October 15, 2004] TECH TALK: The Network Computer: Business Model [October 14, 2004] TECH TALK: The Network Computer: The Fifth Option [October 13, 2004] TECH TALK: The Network Computer: The Four Devices [October 12, 2004]
|
Hi Rajesh,
A very interesting article. Just today spotted on Slashdot about Google Print; Google's answer to A9. But I find these are peripheral to search some times a search engines like Vivismo - http://www.clusty.com can give better search intrepretations.
Santoshkumar
Posted by Santosh KumarGoogle is also harboring ambitions to rule the space that Amazon has been ruling for the last few years. I was reading yes'day that Google has silently launched a service which will help search books across publishers, in a tradeshow for the publishing industry in Europe.
While, one way to look at this is that Google recognizes and so is challenging Amazon's prominence in this space, I also feel that at a broader level Google wants to link (and integrate) search for information across the real (books) and virtual (Internet) resources.
A
Posted by AshuWhile Google has forayed into Amazon's turf and has makes books viewable (a couple of pages back and forth), its real challenge lies in making the online book non-stealable. The current DRM that doesnt let you save the book as an image, is pretty weak and can be easily broken with some javascript/browser trickery. What the brains at Google have to come up with is some better kind of DRM solution, otherwise no publisher in their right mind is going to send them books to be scanned and be made searchable/viewable.
Posted by Anand Jain