<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="wordpress/1.5.1" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
>

<channel>
	<title>Sci7 Ltd</title>
	<link>http://www.sci7.com/cms</link>
	<description>Internet marketing and data-mining. Bioscience specialists.</description>
	<pubDate>Wed, 03 Nov 2010 14:02:07 +0000</pubDate>
	<generator>http://wordpress.org/?v=1.5.1</generator>
	<language>en</language>

		<item>
		<title></title>
		<link>http://www.sci7.com/cms/67/67.html</link>
		<comments>http://www.sci7.com/cms/67/67.html#comments</comments>
		<pubDate>Wed, 03 Nov 2010 14:02:07 +0000</pubDate>
		<dc:creator>Webmaster</dc:creator>
		
	<category>News</category>
		<guid>http://www.sci7.com/cms/67/67.html</guid>
		<description><![CDATA[
]]></description>
			<content:encoded><![CDATA[
]]></content:encoded>
			<wfw:commentRSS>http://www.sci7.com/cms/67/67.html/feed/</wfw:commentRSS>
	</item>
		<item>
		<title>Good Practice Identified -  RightMove.co.uk</title>
		<link>http://www.sci7.com/cms/28/good-practice-identified-rightmovecouk.html</link>
		<comments>http://www.sci7.com/cms/28/good-practice-identified-rightmovecouk.html#comments</comments>
		<pubDate>Mon, 17 Aug 2009 07:05:56 +0000</pubDate>
		<dc:creator>Webmaster</dc:creator>
		
	<category>Resources</category>
		<guid>http://www.sci7.com/cms/28/good-practice-identified-rightmovecouk.html</guid>
		<description><![CDATA[]]></description>
			<content:encoded><![CDATA[]]></content:encoded>
			<wfw:commentRSS>http://www.sci7.com/cms/28/good-practice-identified-rightmovecouk.html/feed/</wfw:commentRSS>
	</item>
		<item>
		<title>Sci7 on Google and government documents</title>
		<link>http://www.sci7.com/cms/48/sci7-on-google-and-government-documents.html</link>
		<comments>http://www.sci7.com/cms/48/sci7-on-google-and-government-documents.html#comments</comments>
		<pubDate>Sun, 18 Dec 2005 00:39:01 +0000</pubDate>
		<dc:creator>Webmaster</dc:creator>
		
	<category>Resources</category>
		<guid>http://www.sci7.com/cms/48/sci7-on-google-and-government-documents.html</guid>
		<description><![CDATA[	Mary (Maggie) Farrell, Dean of Libraries at the University of Wyoming USA recently wrote an editorial in Government Information Quarterly (Vol. 22 (2005) 143145)) entitled : &#8220;Google and government documents&#8221;. 
	The editorial discusses Google Print, and its aims to digitise the collections of major libraries. Google&#8217;s efforts are seen as  more significant than those [...]]]></description>
			<content:encoded><![CDATA[	<p>Mary (Maggie) Farrell, Dean of Libraries at the University of Wyoming USA recently wrote an editorial in Government Information Quarterly (Vol. 22 (2005) 143145)) entitled : &#8220;Google and government documents&#8221;. </p>
	<p>The editorial discusses <a href="http://print.google.com" target="_blank" rel="tag">Google Print</a>, and its aims to digitise the collections of major libraries. Google&#8217;s efforts are seen as  more significant than those which have gone before due to the scale of Google&#8217;s effort as well as technology and funding which are in a completely different class to what has gone before. Google&#8217;s work is also viewed as a potential precursor to digitisation efforts by other major organisations such as Microsoft and Yahoo. </p>
	<p>In the USA government documents are in the public domain and so Government libraries contain vast quantities of free information waiting to be made accessible by those searching online. It is noted that commercial enterprises have been exploiting this data for some time and charging for access to it and that it is possible that Google&#8217;s project could impact that business model. </p>
	<p>Google&#8217;s Mission:</p>
	<blockquote><p>to organize the worlds information and make it universally accessible and useful.</blockquote>
 is compared to that of the Federal Depository Library Programme, which is essentially to do the same for US Government information.  </p>
	<p>A number of current hurdles to Government information being made easily made accessible to all online are noted:</p>
	<ul>
	<li>Internet access</li>
	<li>Document authentication</li>
	<li>Preservation of digital documents</li>
	</ul>
	<p>The fact that there are a number of solutions to the last two points, and the first is likely to be a transitory problem is not discussed. Making Government documents online is going to make them more accessible even to those without internet access, as they simply become more widely accessible throughout society. </p>
	<p>Farrell states in the article that:</p>
	<blockquote><p>I want to reach out to those who believe that the Internet has everything or that Google constitutes a thorough search.</p></blockquote>
	<p>Surprisingly the answer of making Government information available online, in a way such that it can be indexed effectively by search engines, indexes, and other information organising enterprises such as those involved in tagging, local searching etc. isn&#8217;t proposed. The article is suggesting that there will still be a primary role for individual library catalogues, Government Publication Office catalogues and others, and that the question facing librarians is not how to get people the information they&#8217;re searching for but: &#8220;How do we get them beyond Google into library resources?&#8221;, which is more like: &#8220;How do we keep librarians and libraries in the loop?&#8221;. Sci7 believes the kind of questions that could have been asked would have included how can we enable others to interact with our information and catalogues more easily - how do we provide access to catalogue data in machine readable formats.  </p>
	<p>There is no clear separation in the article between past, present and future. While the resources of  <a href="http://technorati.com/tag/google" rel="tag" target="_blank">Google</a> and others may be of value in digitising existing paper content, this is only applicable to the past. For the present and future ensuring that documents are made available in a timely fashion, indexed / tagged / described and linked appropriately, and made available to webcrawling robots is of key importance. Once the information is out there - it is down to the search engines and other session initiation sites to ensure that content is presented when it is an appropriate response to a user query.   </p>
	<p>While preservation of documents is mentioned, the article does not discuss document formats, it appears that this question might be too contentious to be discussed with respect to a Government machine with much information entrapped in closed proprietary Microsoft formats.  </p>
	<p>Lastly two questions are raised, apparently as threats: &#8220;How do we serve patrons whom we never see?&#8221; and  &#8220;How do we justify services and resources in an online environment?&#8221;, whereas Sci7 would counter that making information available online actually makes it possible to &#8220;see&#8221; a lot more about information users than ever before, the potential for detailed statistics and usage tracking makes informed judgements on questions such as value for money being provided by &#8220;library&#8221; services easier.  A key missing, but related question is one of privacy - should the Government be tracking everyone who requests a particular Government document - what should they be allowed to do with that information, for how long should it be stored, etc. The American Government&#8217;s power to access library records has been a major issue in the debate on the extension of the Patriot Act&#8217;s powers in this area. The potential for such invasion of privacy may well be extended by the increased digitisation of Government documents and any discussion of Government information being made available online is incomplete without considering this important area which could significantly affect how individuals choose to interact with Government information.  </p>
	<p>Prior to her position at Wyoming Farrell was librarian and &#8220;Internet Consultant&#8221; at the USA Government Printing Office and Head of Government Publications at the University of Nevada.</p>
	<p>Link to article: <a href="http://dx.doi.org/doi:10.1016/j.giq.2005.03.002" target="_blank">DOI: doi:10.1016/j.giq.2005.03.002</a></p>
	<p>Tags:<a href="http://technorati.com/tag/librarian" rel="tag" target="_blank">Librarian</a> | <a href="http://technorati.com/tag/libraries" rel="tag" target="_blank">Libraries</a> | <a href="http://technorati.com/tag/SEO" rel="tag" target="_blank">SEO</a> | <a href="http://technorati.com/tag/Sci7" rel="tag" target="_blank">Sci7</a></p>
]]></content:encoded>
			<wfw:commentRSS>http://www.sci7.com/cms/48/sci7-on-google-and-government-documents.html/feed/</wfw:commentRSS>
	</item>
		<item>
		<title>The influence of task and gender on search and evaluation behaviour using Google.</title>
		<link>http://www.sci7.com/cms/46/the-influence-of-task-and-gender-on-search-and-evaluation-behaviour-using-google.html</link>
		<comments>http://www.sci7.com/cms/46/the-influence-of-task-and-gender-on-search-and-evaluation-behaviour-using-google.html#comments</comments>
		<pubDate>Thu, 08 Dec 2005 18:17:12 +0000</pubDate>
		<dc:creator>Webmaster</dc:creator>
		
	<category>Resources</category>
		<guid>http://www.sci7.com/cms/46/the-influence-of-task-and-gender-on-search-and-evaluation-behaviour-using-google.html</guid>
		<description><![CDATA[	This is Sci7&#8217;s review of an article by Lori Lorigo, Bing Pan, Helene Hembrooke, Thorsten Joachims, Laura Granka, and Geri Gay to be published in the Journal &#8220;Information Processing and Management&#8221;, these comments and observations are made on a corrected proof &#8220;Article in Press&#8221;. The authors are from Cornell and Stanford Universities, and the research [...]]]></description>
			<content:encoded><![CDATA[	<p>This is <a href="http://www.sci7.com/">Sci7</a>&#8217;s review of an article by Lori Lorigo, Bing Pan, Helene Hembrooke, Thorsten Joachims, Laura Granka, and Geri Gay to be published in the Journal &#8220;Information Processing and Management&#8221;, these comments and observations are made on a corrected proof &#8220;Article in Press&#8221;. The authors are from Cornell and Stanford Universities, and the research was partially funded by Google. </p>
	<p>The researchers used eye tracking technology to assess how users interpret the information on Google&#8217;s search results page. Their results indicate that gender and task do affect the way in which users evaluate search results as revealed both by their eye movements and other actions.</p>
	<h2>Key general findings (as selected by Sci7)</h2>
	<ul>
	<li>Only in one fifth of cases were users observed to read the results in the order in which they are presented </li>
	<li>On average three of the results on a search engine results page were read </li>
	<li>No subjects went beyond the third results page.</li>
	<li>Most of the subjects viewed (not clicked) the first and second results quite equally.</li>
	<li>Subjects clicked on the first result most of the time.<br />
<em>Though the fraction of users clicking the first result was clearly recorded by the researchers, this interesting statistic did not make the paper</em>.</li>
	<li>On more than half of the search sessions, users chose to revise the query terms without clicking on any link.<br />
<em>This emphasises the importance of information, such as the first 90 or so characters of the title, META description and URL, which are often included on the search engine results page.</em>
</li>
	</ul>
	<p>This work utilises a simple and well established broad classification of the types of tasks that users may be trying to achieve when using a search engine, these classes are:</p>
	<ul>
	<li><strong>Navigational</strong> - The search engine is being used to help a user get to a specific page that the user already knows exists and is aiming for</li>
	<li><strong>Transactional</strong> - The search engine is used as a step in an online transaction eg. The user wants to make a purchase / reservation / send a message. </li>
	<li><strong>Informational</strong> - The search engine is used assist a user find information that they are looking for.</li>
	</ul>
	<p>The scale of this research, as with many studies on online behaviour appears tiny, this paper is based on the results derived from only 23 individuals, ~400 Google queries, and ~600 Google results pages.</p>
	<p>The researchers wrote navigational and informational tasks to be completed by their student test subjects. Navigational tasks were to get to a particular page eg. &#8220;Find the homepage of Michael Jordan, the statistician&#8221;, and informational tasks required the discovery online of the answers to factual questions such as: &#8220;Where is the tallest mountain in NY located?&#8221;.</p>
	<p>Two minutes were given to answer each query, eye tracking and transaction log data were recorded. Eye-tracking hardware and software was from <a href="http://www.a-s-l.com/" target="_blank">A.S.L</a> . Three different types of &#8220;looking&#8221; at ten regions of interest on the screen, corresponding to each of the natural search results on the results page were recorded. The visual interaction with each region being either a saccade (very brief look less than 50 ms), fixation (at least 200 ms on a region) , or a scanpath (a sequence of fixations) was recorded. Pupil dilation was also recorded as a measure of the interest a subject was taking in a particular page. The accuracy of the equipment did not enable determination of if the subject was reading the snippet, title, url or other element of each result. There was no mention in the paper of Google adverts, this is an odd omission. </p>
	<p>ANOVA and Chi squared analysis was used to determine if results obtained for the different gender and task classes were significantly different. These statistical measurements do not take into account the sample size relative to the number of individuals conducting web searches in the real world when assessing relevance - any significance found only refers to the small experimental sample.  While task success rates were not significantly different based on either task or gender, the manner in which the search engine results page was interpreted varied. Actions in which there was a significant (at the 0.01 level) difference between males and females included:</p>
	<ul>
<li>Females clicked on the second link twice as often as males. (Males clicked the second link 7% of the time compared to females&#8217; 14.5%.)</li>
	<li>Males viewed more results pages than females.</li>
	<li>Females repeatedly viewed the same result more often than men.</li>
	<li>Males were more likely to click on the lower ranking links (links 7,8,9,10) on a results page, these include those scrolling was required to reveal in the test environment, though males were more likely than females to follow link 6 too - though the difference wasn&#8217;t statistically significant -  suggesting &#8220;Males are more likely to click lower ranking links&#8221;, is a more defensible conclusion, than &#8220;Males are more likely to click links they have scrolled to reveal&#8221;.</li>
	</ul>
	<p>General additional observations included : &#8220;Males look at a greater number of abstracts and also spend more time examining the result page in general [than females]&#8221;. </p>
	<p>In terms of differences between tasks, no &#8220;transactional&#8221; tasks were evaluated, while Sci7 can understand that it may have been impractical (expensive) to get test subjects to make an online purchase for example, and many other transactional tasks require registration / signup etc.  the reasons for this omission should have been discussed in the paper. Such work could have been quite informative and could have looked at questions such as at what stage the different groups used search engines during the transaction. </p>
	<p>The results for the navigational tasks were as intuitively expected:</p>
	<blockquote><p>&#8220;Intuitively, a greater proportion of time is spent on Google result pages for navigational tasks since navigational questions do not require much additional scrutiny on web documents outside of Google; once the appropriate query result is found, one immediately has the answer for the navigational query.&#8221;</p></blockquote>
	<p>The statements in the above quote are backed up by the statistical data presented. </p>
	<p>Interestingly the paper currently under review refers to earlier work which estimated that in 2000, 1 in 28 of all web page views were of search engine results pages. Reference: Jansen, B. J., &#038; Pooch, U. (2000). <em>Web user studies: a review and framework for future work.</em> <strong>Journal of the American Society of Information Science and Technology,</strong> 52(3), 235246.</p>
	<p>The academic home page of the first author Lori Lorigo (It is normal academic practice that the first author is the one who has done the bulk of the work):<br />
<a href="http://www.hci.cornell.edu/people/lorigo.htm" target="_blank">http://www.hci.cornell.edu/people/lorigo.htm</a></p>
	<p>Sci7 has emailed Lori Lorigo inviting her to comment on this article, specifically on:</p>
	<ol>
	<li>The omission of any mention of Google Ads. Were users instructed not to click ads?. Where the ads present during the experiment?</li>
	<li>Did the Google funding come with any conditions? - Is this the reason ads were not mentioned / discussed?
</li>
	<li>Why transactional queries were not investigated.</li>
	<li>What is the fraction of users who clicked, the first link, the second link etc. ?</li>
	</ol>
	<p>DOI link to reviewed paper: <a href="http://dx.doi.org/10.1016/j.ipm.2005.10.001" target="_blank">10.1016/j.ipm.2005.10.001</a></p>
	<p>At the time of writing the author&#8217;s home and publications page are the only results on <a href="http://www.google.com/search?q=%22The+Influence+of+Task+and+Gender+on+Search+and+Evaluation%22" target="_blank">Google for a search for the article title</a>, suggesting Sci7 is the first to critically review or cite this work online:</p>
	<p><a href="http://www.sci7.com">Sci7&#8217;s web-optimization services</a> are informed by an awareness of online trends as well as published and proprietary research. </p>
	<p>Technorati Tags: <a href="http://technorati.com/tag/Google" rel="tag">Google</a> | <a href="http://technorati.com/tag/Google+Rank" rel="tag">Google+Rank</a> | <a href="http://technorati.com/tag/Usability" rel="tag">Usability</a> | <a href="http://technorati.com/tag/Search" rel="tag">Search</a>.
</p>
]]></content:encoded>
			<wfw:commentRSS>http://www.sci7.com/cms/46/the-influence-of-task-and-gender-on-search-and-evaluation-behaviour-using-google.html/feed/</wfw:commentRSS>
	</item>
		<item>
		<title>Frequency of occurrence of numbers in the World Wide Web</title>
		<link>http://www.sci7.com/cms/44/frequency-of-occurrence-of-numbers-in-the-world-wide-web.html</link>
		<comments>http://www.sci7.com/cms/44/frequency-of-occurrence-of-numbers-in-the-world-wide-web.html#comments</comments>
		<pubDate>Wed, 07 Dec 2005 22:36:54 +0000</pubDate>
		<dc:creator>Webmaster</dc:creator>
		
	<category>Resources</category>
		<guid>http://www.sci7.com/cms/44/frequency-of-occurrence-of-numbers-in-the-world-wide-web.html</guid>
		<description><![CDATA[	In July of 2005 S.N. Dorogovtseva, J.F.F. Mendesa and J.G. Oliveira published the results of an experiment in which they put every number from 0 to 100 000 into Google, noted the number of results obtained. The paper states this research was (amazingly to Sci7) partially supported by grant money. Citation: Physica A 360 (2006) [...]]]></description>
			<content:encoded><![CDATA[	<p>In July of 2005 S.N. Dorogovtseva, J.F.F. Mendesa and J.G. Oliveira published the results of an experiment in which they put every number from 0 to 100 000 into Google, noted the number of results obtained. The paper states this research was (amazingly to Sci7) partially supported by grant money. Citation: <em>Physica A 360 (2006) 548556</em></p>
	<p>The findings are largely unsurprising, with the popularity of numbers generally decreasing with size, powers of ten being more common than their neighbours, and various &#8220;special&#8221; numbers being particularly common,  groups of special were identified as:</p>
	<ul>
	<li>Powers of 10</li>
	<li>Multiples of 10 and 5</li>
	<li>
Easy to remember or symmetric numbers eg. 666 and 131313</li>
	<li>
Powers of 2</li>
	<li>
Numbers with strong associations eg. 666</li>
	<li>
Popular zip codes eg. 78701 </li>
	<li>
Toll free telephone number prefixes eg. 866, 877</li>
	<li>
Important historical dates eg. 1812 </li>
	<li>
Serial numbers of popular products 747, 8086 </li>
	<li>
Initial parts of mathematical constants 314159 </li>
	</ul>
	<p>The data was collected in the second week of December 2004, the number &#8220;2004&#8243; was present at a particularly high frequency (3,030,000,000 pages), with a rapid fall off in popularity of future years. </p>
	<p>Sci7 has at the begining of the second week of December 2005 obtained a page count for the years 1990-2015:</p>
	<table border=1>
	<tr>
<td>1990</td>
	<td>268,000,000</td>
</tr>
	<tr>
<td>1991</td>
	<td>213,000,000</td>
</tr>
	<tr>
<td>1992</td>
	<td>246,000,000</td>
</tr>
	<tr>
<td>1993</td>
	<td>239,000,000</td>
</tr>
	<tr>
<td>1994</td>
	<td>359,000,000</td>
</tr>
	<tr>
<td>1995</td>
	<td>588,000,000</td>
</tr>
	<tr>
<td>1996</td>
	<td>562,000,000</td>
</tr>
	<tr>
<td>1997</td>
	<td>566,000,000</td>
</tr>
	<tr>
<td>1998</td>
	<td>658,000,000</td>
</tr>
	<tr>
<td>1999</td>
	<td>795,000,000</td>
</tr>
	<tr>
<td>2000</td>
	<td>1,440,000,000</td>
</tr>
	<tr>
<td>2001</td>
	<td>1,250,000,000</td>
</tr>
	<tr>
<td>2002</td>
	<td>1,340,000,000</td>
</tr>
	<tr>
<td>2003</td>
	<td>1,610,000,000</td>
</tr>
	<tr>
<td>2004</td>
	<td>2,140,000,000</td>
</tr>
	<tr>
<td>2005</td>
	<td>6,680,000,000</td>
</tr>
	<tr>
<td>2006</td>
	<td>1,020,000,000</td>
</tr>
	<tr>
<td>2007</td>
	<td>157,000,000</td>
</tr>
	<tr>
<td>2008</td>
	<td>103,000,000</td>
</tr>
	<tr>
<td>2009</td>
	<td>52,600,000</td>
</tr>
	<tr>
<td>2010</td>
	<td>93,400,000</td>
</tr>
	<tr>
<td>2011</td>
	<td>24,500,000</td>
</tr>
	<tr>
<td>2012</td>
	<td>39,300,000</td>
</tr>
	<tr>
<td>2013</td>
	<td>16,200,000</td>
</tr>
	<tr>
<td>2014</td>
	<td>13,500,000</td>
</tr>
	<tr>
<td>2015</td>
	<td>29,100,000</td>
</tr>
	</table>
	<p>The top result for all the years to-date is the Wikipedia article for the year, the results for the years to come varies and includes the official london2012 site for 2012 and 2015.com. The popularity of the current year and fall off in future years is also seen in the above table. It is interesting to note that the current page count for &#8220;2004&#8243; in December 2005 is 0.7 of what is was in December 2004. This could be interpreted as suggesting the web, as reflected in Google&#8217;s index is being purged of outdated information, or could be a reflection of the number of sites which display the current year on them for various reasons. 1992 is slightly anomalous in that there are currently more pages on Google referring to it than 1993. </p>
	<p>The accuracy of Google&#8217;s number of pages returned count is incredibly not discussed, and neither is Google&#8217;s progress as of 2004 towards its stated aim of making the entirety of the world&#8217;s information searchable, and the bias of the current incomplete index of all human knowledge which Google holds. </p>
	<p>Sci7 is able to produce datasets such as those used for the research discussed here (and those with much greater complexity) from a wide variety of sources. </p>
	<p>A free full text PDF of the original article is available:<br />
<a href="http://arxiv.org/pdf/physics/0504185" target="_blank">http://arxiv.org/pdf/physics/0504185</a></p>
]]></content:encoded>
			<wfw:commentRSS>http://www.sci7.com/cms/44/frequency-of-occurrence-of-numbers-in-the-world-wide-web.html/feed/</wfw:commentRSS>
	</item>
		<item>
		<title>Multitasking during Web search sessions</title>
		<link>http://www.sci7.com/cms/43/multitasking-during-web-search-sessions.html</link>
		<comments>http://www.sci7.com/cms/43/multitasking-during-web-search-sessions.html#comments</comments>
		<pubDate>Wed, 07 Dec 2005 20:29:49 +0000</pubDate>
		<dc:creator>Webmaster</dc:creator>
		
	<category>Resources</category>
		<guid>http://www.sci7.com/cms/43/multitasking-during-web-search-sessions.html</guid>
		<description><![CDATA[	This is Sci7 Ltd&#8217;s review of an article by Amanda Spink, Minsoo Park, Bernard J. Jansen and Jan Pedersen published in volume 42 of the journal &#8220;Information Processing and Management&#8221; (pages 264275) with an expected publication date of early 2006. 
	Data from Altavista
	The research discussed is based on analysis of users of the Altavista search [...]]]></description>
			<content:encoded><![CDATA[	<p>This is Sci7 Ltd&#8217;s review of an article by Amanda Spink, Minsoo Park, Bernard J. Jansen and Jan Pedersen published in volume 42 of the journal &#8220;Information Processing and Management&#8221; (pages 264275) with an expected publication date of early 2006. </p>
	<h2>Data from Altavista</h2>
	<p>The research discussed is based on analysis of users of the Altavista search engine, the paper states that data was obtained from Altavista &#8220;transaction logs&#8221; from 2002. The fact that this data is not recent is one weakness of the research conducted, though the fact that at the time Altavista had a larger fraction of search users than it can now boast is given to support the use of this dataset. The logs are for a 24h period, taken on a Sunday, it is possible that weekend search queries are different to weekday ones, this isn&#8217;t an issue addressed by this research. Interestingly the analysis was affected by the presence of automated searches that were being submitted to Altavista. Sci7 believes the method used for removing such &#8216;robot&#8217; derived hits was dubious in that all sessions of more than one hundred queries were removed. Sci7&#8217;s believes  smarter techniques for identifying robots such as looking for very short intervals between queries, and using &#8220;User Agent&#8221; may have been more appriopriate. The apparent assumption that users have unique IP addresses could well have resulted in significant chunks of the user base such as those in institutions using web-caching, and those using certain ISPs being excluded from the dataset by being identified as robots by the techniques applied. </p>
	<h2>Conflict of Interest?</h2>
	<p>It is perhaps likely that Altavista would make such out dated information available for academic research, however the fact one of the authors, Pendersen  works for Yahoo which provides the search function on Altavista, is presumably the situation which enabled the data to be obtained for the study. The potential for a conflict of interest should be considered when considering the results of the paper discussed here. </p>
	<h2>User Behaviour</h2>
	<p>The research focuses on how users interact with search engines, looking at areas such as multitasking - what else users are doing while they&#8217;re involved in a search session, and the type of queries that they are using.</p>
	<p>Key findings presented included:</p>
	<ul>
<li>Sessions ranged in duration from less than a minute to a few hours.</li>
	<li>In only 19% of two-query sessions were both queries on the same topic.</li>
	<li>91.3% of three or more query sessions included multiple topics.</li>
	<li>There are a broad variety of topics in multitasking search sessions.</li>
</ul>
	<p>A session is defined as : &#8220;&#8230;the entire series of queries submitted by a user during one interaction with the Web search engine. Session length varied from less than a minute to a few hours.&#8221;. Sci7 would have been given more confidence by a rigourous definition based on the method used to identify queries from the same session from the raw transaction logs. </p>
	<p>A key question is if the multiple search terms are devised to try and find the same information for the user, the suggestion is made - due to other referenced research that the multiple topics are due to users seeking a range of different information within the session. This referenced work includes studies carried out on data from other search engines including Excite and Alltheweb. These sources suggest that an average &#8220;multitasking&#8221; search session involves two topics.  This is a bit of an odd statistic as if a single query session is the most common, two queries the next most common and further multiples of queries even less common it is clear that the most popular &#8220;multitasking&#8221; query is one with two queries. </p>
	<p>An sample of &#8220;multitasking&#8217; is given in the paper:</p>
	<blockquote><p>In Example 1, the user ranged over more than three topics during a period of 11 minutes from crystal meth to ephedrine then catwoman and finally plastic surgery.</p></blockquote>
	<h2>Sample Size</h2>
	<p>A random set of sessions were taken from the dataset obtained from altavista, and split depending on the number of queries per session. The sets then obtained were tiny - only 254 two query sessions, 206 of which were determined to be &#8220;multitasking&#8221; for example. 254 sessions is in the view of Sci7 far too small a dataset from which to draw conclusions of any sort when compared to the 250 million searches per day  in 2003 on Google and partners reported by Danny Sullivan of Search Engine Watch (Searchenginewatch.com), citing Google itsself as the source of that number.  </p>
	<h2>Importance</h2>
	<p>An apparent important omission from this paper appears to be the relative number of single query sessions, without this information it is difficult to judge the importance of the research presented here on multi query sessions. The only evidence given related to this crucial question is derived from an Excite user survey which is a rather different approach to data collection than log analysis. The Excite survey revealed 96.2% of sessions were single session queries, bringing the impact of research, such as that being reviewed here focused on the remaining 3.8% (11 out of 287) into question.</p>
	<h2>Author&#8217;s Recommentations</h2>
	<p>A number of recommendations are made, these are commented on below:</p>
	<ul>
	<li>Provide users with the ability to access, refine, and use results from a previous searches within the confines of a session across multiple topics.<br />
<em>What comes out from the research is in fact that where this is done it should be done with care as the previous results within the session are unlikely to be related. </em>
</li>
	<li>
Assist users in coordinating multiple topics into effective queries (i.e., search histories, various thesauri or keyword generation tools).<br />
<em>There are two options here, one is to bring multiple topics together, the other is to keep them separate, and to ensure that sequential searches are generally treated independently, or at least the opportunity to treat them independently is maintained. </em>
</li>
	<li>
Provide searchers the ability to create multiple sets of working notes related to different or related search topics (i.e., sketching, hyper-linking, and note creation tools).<em>To allow multiple distinct sets of any such notes is I think the key point here, and one not well made by the recommendations.</em></li>
	<li>
Enable the submission and tracking of multiple queries concurrently on different or related topics. <em>There is a basis for this, but research on fraction of sessions which involve multiple searches should be reviewed in order to determine the prominence such features should be given. </em></li>
	<li>
Allow for searching multiple search engines or collections concurrently on multiple topics.<em>No data on users using multiple search engines or collections concurrently is introduced within this paper</em></li>
	<li>
Enable the reformulation of multiple queries on different or related topics.<em>What search engine doesn&#8217;t provide this functionality?</em></li>
	<li>
Provide windowing facilities to allow Web users to generate and track separate topic or related topic queries and facilitate topic switching.<em>Again, this appears a sensible suggestion, but the evidence provided suggests that this will only be required by the minority of users engaged in multi-query search sessions, so should perhaps not be present in a &#8220;default&#8221; or &#8220;simple&#8221; search interface</em></li>
	<li>
Enable the generation and comparison of relevance judgments from different or related searches.<em>Allowing for either different or related searches appears sensible</em></li>
	<li>
Enable the tracking, storing and manipulating retrieved results and printouts related to different topics over multiple searches.<em>Printouts?!, no evidence is presented that users print their search results, or that people store search results, this recommendation appears well beyond the scope of of the research either conducted or referenced </em></li>
	<li>
Provide the ability to create clusters of retrieved information related to different topics.<em>This is absolutely indicated, but really only becomes important when the search interface is such that searches are &#8220;remembered&#8221; and past searches influence future results. If retrieved information is to be stored, then it should be clustered into various different topics. </em></li>
	</ul>
	<h2>Sci7&#8217;s Summery</h2>
	<p>While much of the above suggestions appear sensible, many do not appear to directly follow from the research results described in the paper, and are therefore simply opinions and suggestions, the fact they appear in a &#8220;research paper&#8221; should not give them undue credibility. </p>
	<p>The key finding of this research can be summarised as:<br />
<strong>When users enter multiple queries within a session they are highly likely (81.1% if two queries, 91.3% if three) to be unrelated queries not on the same topic.</strong></p>
	<p>In Sci7&#8217;s view the key recommendation coming out of this work is that simply that this pattern of behaviour should be recognised. User interfaces of search engines should not make the apparently unfounded assumption that sequential queries in the same session will be on the same topic. In practice this means that options such as &#8220;search within results&#8221;, and &#8220;conduct this same search on another database&#8221; should be given lesser prominence than options to simply start a new search from scratch. </p>
	<p>Sci7&#8217;s advice and recommendations are based on a scientific, evidence based approach to the optimisation of online services, derived from both propriety and published research. </p>
]]></content:encoded>
			<wfw:commentRSS>http://www.sci7.com/cms/43/multitasking-during-web-search-sessions.html/feed/</wfw:commentRSS>
	</item>
		<item>
		<title>Sci7 on The Anatomy of a Large-Scale Hypertextual Web Search Engine</title>
		<link>http://www.sci7.com/cms/42/sci7-on-the-anatomy-of-a-large-scale-hypertextual-web-search-engine.html</link>
		<comments>http://www.sci7.com/cms/42/sci7-on-the-anatomy-of-a-large-scale-hypertextual-web-search-engine.html#comments</comments>
		<pubDate>Tue, 06 Dec 2005 19:15:09 +0000</pubDate>
		<dc:creator>Webmaster</dc:creator>
		
	<category>Resources</category>
		<guid>http://www.sci7.com/cms/42/sci7-on-the-anatomy-of-a-large-scale-hypertextual-web-search-engine.html</guid>
		<description><![CDATA[	Sci7&#8217;s website optimisation services are informed by a scientific approach, which includes an awareness of both current and historical trends.
	Google
	Google&#8217;s emergence into the search engine scene was an important occurance in the history of search on the web. The founders of Google, Sergey Brin and Lawrence Page wrote a paper describing the first incarnation of [...]]]></description>
			<content:encoded><![CDATA[	<p>Sci7&#8217;s website optimisation services are informed by a scientific approach, which includes an awareness of both current and historical trends.</p>
	<h2>Google</h2>
	<p>Google&#8217;s emergence into the search engine scene was an important occurance in the history of search on the web. The founders of Google, Sergey Brin and Lawrence Page wrote a paper describing the first incarnation of their search engine when the internet application was a prototype located at Stanford University, here this paper is discussed by Sci7 Ltd. </p>
	<h2>Session Initiation</h2>
	<p>In this paper they noted that people were likely to surf the web, following links from one page to another, starting their journey at either a directory or search engine. Search engines have since become the most popular starting point for a &#8220;web session&#8221;. Other starting points have emerged such as RSS newsreaders which alert users to fresh content on sites which they are interested in, there are also a variety of &#8220;push&#8221; technologies which have a similar effect on the initiation of the browsing experience as RSS. Institutional intranet pages, and email clients (web based or not) are also current significant starting points, though many of these will themselves be portals including feeds (content supplied by push, or push-like technology)&#8221;, search boxes, and directories. Novel manners in which surfing sessions are emerging, in many European cities when you enter your phone will receive a text advertising what&#8217;s available locally, these message can include web-links. When wireless users connect to networks they are often first presented with a splash registration page from the network provider. Future initiation events might include brining a certain product into proximity of a computer, the product could be detected by its internal RFID tags, or by image recognition via a webcam. </p>
	<h2>PageRank</h2>
	<p>Brin and Page&#8217;s paper introduced the basis of PageRank, where information present in terms of which pages and sites link to each other is extracted and interpreted. It is worth noting that this paper was not the first to describe using what the paper describes as &#8221; &#8220;, for determining the relative relevance of pages. Three groups of people who had previously described such techniques are cited:</p>
	<ul>
<li>Ellen Spertus. ParaSite: Mining Structural Information on the Web. The Sixth International WWW Conference (WWW 97). Santa Clara, USA, April 7-11, 1997.</li>
	<li>Massimo Marchiori. The Quest for Correct Information on the Web: Hyper Search Engines. The Sixth International WWW Conference (WWW 97). Santa Clara, USA, April 7-11, 1997.</li>
	<li>Jon Kleinberg, Authoritative Sources in a Hyperlinked Environment, Proc. ACM-SIAM Symposium on Discrete Algorithms, 1998.</li>
	<li>Ron Weiss, Bienvenido Velez, Mark A. Sheldon, Chanathip Manprempre, Peter Szilagyi, Andrzej Duda, and David K. Gifford. HyPursuit: A Hierarchical Network Search Engine that Exploits Content-Link Hypertext Clustering. Proceedings of the 7th ACM Conference on Hypertext. New York, 1996.</li>
</ul>
	<p>There is also a reference in the paper to the way in which the relevance and importance of academic papers has been assessed by the number of references to them in other academic papers. In the academic world this approach is known to have a number major flaws, one is that a particular type of paper tends to get a disproportionate number of citations - that&#8217;s one describing how to do a particular type of experiment for the first time, that&#8217;s because from then on all scientists using that technique cite the original paper. Google&#8217;s PageRank takes into account both the number of links coming to a page, and the PageRank of the pages on which those inbound links were found. </p>
	<h2>Link Text</h2>
	<p>The second technique, used by the prototype Google in addition to PageRank, is analysis of the link text, the text enclosed by the anchor tags. The paper describes the association of such link text with both the page that it is on, and the page it links to. This makes sense as link text is usually &#8220;important&#8221;, in that it is highlighted, like titles, URLs, capitalised text, headings, and other specially tagged text, the use of such indicators is also mentioned in the paper. The Google founders also note that the link text is likely to offer a better description of the target page content than can be obtained from the target page, the fact that content such as images which contain no inherent descriptive textual descriptive content can be associated with the link text of inbound links. </p>
	<h2>Updates</h2>
	<p>The reason behind the three month update cycle at Google, and one purpose of the &#8220;see more results&#8221; link is revealed by the article, one is that indexes need to be rebuilt and this takes time, and another is that a first pass of the results takes place on pre-computed collections of data, and only if that is insufficient is the whole dataset queried. </p>
	<h2>Google Local</h2>
	<p>Even this first incarnation of Google is revealed to make use of proximity in ordering the search results, though it is almost certain that at this stage this was at the country level, rather than the city / street level that is emerging with Google local and other services. </p>
	<p>The prototype Google described had a databank of 24 million web pages 147GB, which appears decidedly tiny compared to current data storage requirements for even highly specialised crawling options. </p>
	<h2>Feedback</h2>
	<p>The fact that the amount of traffic a page receives should ideally be taken into account in producing the search results is discussed. At the time though Google did not have access to such information, as of 2005 Google tracks which sites people click to from the Google results (and tracks Google Toolbar users to a greater degree). The tracking information can be used to infer the traffic to various sites, as well as provide a way of determining the relevance of search results. This feedback loop, where commonly clicked links are promoted is known to be used for advert positioning by Google - its use in the search results has not yet been confirmed, though Sci7&#8217;s internal (ongoing) research reveals indications that it is occurring. </p>
	<p>Some key points about Google&#8217;s data management revealed:</p>
	<ul>
<li>Whenever Google comes across a new URL in a page a unique ID number is assigned to that URL.</li>
	<li>
</li>
	<li>Crawled documents are stored in a compressed form (zlib) on Google&#8217;s servers.</li>
	<li>Each document is split into chunks, these are ranked depending how far into the document they occur, and how they are tagged.</li>
	<li>These chunks are then grouped.</li>
	<li>Links are extracted from the crawled document and placed in another file(index), along with the link text, source and destination. </li>
	<li>Relative URLs are corrected into absolute URLs.</li>
	<li>A database of pairs of ID numbers (linked pages) is created, this is an element of the raw data on from which PageRank is calculated.</li>
	</ul>
	<p>Metadata is described as a poor source of information for determining relevance as poor, as information not visible to the user is often used intentionally to attempt to manipulate search results. </p>
	<h2>Challenges for a Search Engine</h2>
	<p>The paper gives much insight into what the challenges faced by a search engine are: &#8220;One of the main causes of this problem is that the number of documents in the indices has been increasing by many orders of magnitude, but the user&#8217;s ability to look at documents has not. People are still only willing to look at the first few tens of results.&#8221; This problem can be viewed from the point of view of an organisation or individual running a website - if you&#8217;re not in the top thirty or so results on the popular search engines for relevant search terms then you won&#8217;t get much traffic from them. This problem might be lessened with localisation technology, many searches are naturally localised, localisation can reduce the number of documents from which your search results are to be selected. </p>
	<h2>Googlebot</h2>
	<p>The Google web crawler, now known as the Googlebot is discussed. </p>
	<ul>
<li>It is made of a number of machines, URL servers, and URL grabbers</li>
	<li>The machines which get the data hold DNS caches</li>
	<li>The code for this element of the operation is written in Python</li>
	</ul>
	<p>Crawling problems are also discussed, in that crawling an online game resulted in problems as the Googlebot presumably behaved like a crazed player following allsorts of links. Sci7 has been made aware of a re-emergence of this kind of problem when companies run the Google search appliance within their company networks, or from IP addresses which are logged into online applications. </p>
	<p>Even this early prototype reveals the use of &#8220;trusted feedback&#8221;, in the this case just for evaluation of the results, presumably for improving the algorithm. However Sci7 now believes Google uses a large number of individuals to manually check results, primarily for the purpose of removing search engine spam from the system, Sci7 customers get the benefit from our knowing the criteria used by these individuals to assess sites. </p>
	<h2>Google Rank</h2>
	<p>The paper uses as an example throughout the name of the then current US president, highlighting the fact the top result as whitehouse.gov as evidence that the search engine is working well. While this is the case with the current president the other results are interesting with bushorchimp.com coming in at number four, and much has been made of &#8220;google bombs&#8221; such as:<br />
http://www.google.com/search?q=failure<br />
These effects are created by many people online creating links with to the Whitehouse official biography of George W Bush using the word <a href="http://www.whitehouse.gov/president/gwbbio.html">failure</a> as the link text as has been done in this sentence. </p>
	<h2>Forward Looking</h2>
	<p>Much is also made in this early paper about the scalability of the techniques used, steps such as designing their own operating system capable of dealing with very large files, and use in a failure tolerant distributive environment. Thinking about the future is clearly important for all organisations involved in online technologies where the future has a tendency to arrive very quickly. </p>
	<h2>Sci7</h2>
	<p>Sci7 provides website optimisation consultancy, tools, and software, among its portfolio of services and information products details on the <a href="http://www.sci7.com>Sci7 Homepage</a>.</p>
]]></content:encoded>
			<wfw:commentRSS>http://www.sci7.com/cms/42/sci7-on-the-anatomy-of-a-large-scale-hypertextual-web-search-engine.html/feed/</wfw:commentRSS>
	</item>
		<item>
		<title>Google Base</title>
		<link>http://www.sci7.com/cms/41/google-base.html</link>
		<comments>http://www.sci7.com/cms/41/google-base.html#comments</comments>
		<pubDate>Wed, 16 Nov 2005 15:13:43 +0000</pubDate>
		<dc:creator>Webmaster</dc:creator>
		
	<category>Resources</category>
		<guid>http://www.sci7.com/cms/41/google-base.html</guid>
		<description><![CDATA[	Google have recently launched Google Base - http://www.google.com/base/, a system which gives content providers the ability to upload content directly to a database hosted by Google. Many types of content can be submitted to the database, ideally in a format which makes it easy for Google&#8217;s search methods to supply the data in search results [...]]]></description>
			<content:encoded><![CDATA[	<p>Google have recently launched Google Base - <a href="http://www.google.com/base/">http://www.google.com/base/</a>, a system which gives content providers the ability to upload content directly to a database hosted by Google. Many types of content can be submitted to the database, ideally in a format which makes it easy for Google&#8217;s search methods to supply the data in search results when it is relevant to do so. It is possible to tag and describe the information that you are uploading so that it&#8217;s relevance and connection with other data can be &#8220;understood&#8221; by Google&#8217;s computers.  </p>
	<p>This can be best seen with an example, Google suggests that information on jobs available is submitted to their database with the information contained within descriptive tags such as:</p>
	<p><code>&lt;g :job_industry&gt; &lt;/g&gt;<br />
&lt;g :employer&gt; &lt;/g&gt;<br />
&lt;g :job_function&gt; &lt;/g&gt;<br />
&lt;g :job_type&gt; &lt;/g&gt;<br />
&lt;g :salary&gt; &lt;/g&gt;<br />
&lt;g :salary_type&gt; &lt;/g&gt;<br />
&lt;g :education&gt; &lt;/g&gt;<br />
&lt;g :immigration_status&gt; &lt;/g&gt;<br />
&lt;g :location&gt; &lt;/g&gt;</code></p>
	<p>There are also formats for scientific research data (which if used may well result in relevant information being available via a Google Scholar search), as well as formats for describing products and services and a vast amount of other types of data, such as addresses which can be searched by Google local. The scheme is expandable with the opportunity to add new descriptive tags for different types of content. Products added to the Google database may, if relevant, appear in Froogle search results. </p>
	<p>Sci7 is able to offer a number of services to enable organisations to ensure that they are supplying their content to Google, and the other search engines and information organising sites on the web. In terms of the Google database we offer the automated translation of existing web/database content into formats optimized for submission to the Google database, we will also manage regular automated updates of your organization&#8217;s Google database content. </p>
	<p>These services are available in a variety of formats depending on your organization&#8217;s needs:</p>
	<ul>
<li>As part of a broader website / search engine optimization package.</li>
	<li>As custom designed modifications / add-ons to your existing IT systems</li>
	<li>As a stand alone service where Sci7 automatically obtains relevant information from your website, refactors and repackages and submits it to the Google database at an appropriate frequency.</li>
	</ul>
	<h3>Frequently asked client questions:</h3>
	<p><strong>We are already supplying product information to Froogle, how does this affect me?</strong><br />
<em>There is now an opportunity to improve the machine readability of your product feed, and to include a broader range of information on your company and its products and services. </em></p>
	<p><strong>My organization&#8217;s RSS feeds are already automatically picked up by Google and appear in various search results, do I need to submit them to the Google database?</strong><br />
<em>The Google database now provides you with more control over how and when your information reaches Google. If your feed is not being consumed by Google regularly enough, the new services enable you to push your feed to Google. RSS feeds are an extensible format - new descriptive elements can be added within them, with the public launch of the Google database many such tags, which Google &#8220;understands&#8221; have been released, Sci7 is able to optimise your feeds to include information presented in the required format.</em></p>
	<p><strong>Why?</strong><br />
<em>To make sure links to your site / content appear when relevant search terms are entered into Google, particularly its specialised search tools such as Google Local, Google Scholar, Froogle etc. </em></p>
	<p><strong>Who?</strong><br />
<em>Everyone! Google&#8217;s database can host information about products and services which are not available on the web. Sci7 is able to assist and advise the full range of organisations who can benefit from optimized management of the data held on them in the Google database, from small companies with no or little IT experience to large technologically savvy organisations. </em></p>
	<p><a href="http://technorati.com/tag/googlebase" rel="tag">Googlebase</a> | <a href="http://technorati.com/tag/goog" rel="tag">GOOG</a> | <a href="http://technorati.com/tag/seo" rel="tag">SEO</a> | <a href="http://technorati.com/tag/google" rel="tag">Google</a></p>
]]></content:encoded>
			<wfw:commentRSS>http://www.sci7.com/cms/41/google-base.html/feed/</wfw:commentRSS>
	</item>
		<item>
		<title>Web 2.0</title>
		<link>http://www.sci7.com/cms/47/web-20.html</link>
		<comments>http://www.sci7.com/cms/47/web-20.html#comments</comments>
		<pubDate>Tue, 01 Nov 2005 01:24:50 +0000</pubDate>
		<dc:creator>Webmaster</dc:creator>
		
	<category>Resources</category>
		<guid>http://www.sci7.com/cms/47/web-20.html</guid>
		<description><![CDATA[	There is currently a wealth of discussion which uses the terms Web 1.0, Web 2.0, and more rarely Web 3.0. Wherever these terms are used, and whatever technologies are placed into the particular categories what is typically found is an analysis of how online (it is not limited to the &#8220;web&#8221;) technologies and behaviours have [...]]]></description>
			<content:encoded><![CDATA[	<p>There is currently a wealth of discussion which uses the terms Web 1.0, Web 2.0, and more rarely Web 3.0. Wherever these terms are used, and whatever technologies are placed into the particular categories what is typically found is an analysis of how online (it is not limited to the &#8220;web&#8221;) technologies and behaviours have developed. Often there will be a discussion of where we are now and typically an attempt to determine future trends. That&#8217;s all there is, discussion, and some usually fairly vague attempt at classification, these terms do not yet have any broadly accepted definitions, though there is some emerging consensus, and they certainly don&#8217;t describe any new standards or new browsers. </p>
	<p>Typically &#8220;Web 2.0&#8243; is considered to be made up of websites and applications which have certain features shared by a number of currently successful sites. In a September 2005 article by Tim O&#8217;Reilly entitled <a href="http://www.oreillynet.com/pub/a/oreilly/tim/news/2005/09/30/what-is-web-20.html?page=1" target="_blank">What is Web 2.0</a> attempted to define some of the characteristics of Web 2.0 technologies. He identified sites such as Flickr and Wikipedia were part of &#8220;Web 2.0&#8243;, and compared them to their Web 1.0 &#8220;predecessors&#8221; <em>Ofoto</em> and <em>Britannica Online</em> respectively. Looking at those examples it is possible to see how there are common features such as:</p>
	<ul>
<li>User generated / &#8220;owned&#8221; content. </li>
	<li>Easy automated interaction between sites via APIs, Feeds, RPIs etc.</li>
	<li>Community / Participation.</li>
	<li>Standards compliance.</li>
	<li>Separation of content from presentation.</li>
	<li>Some rights reserved licensing.</li>
	</ul>
	<p>A common question is &#8220;How can we join in?&#8221;. And that is something that Sci7 has been helping companies with for some years now. Enabling users to add comments to sites, producing not only your latest articles in RSS (or other) feeds, but also your comments and perhaps search results via OpenSearch. Do you make the raw content product catalogue available to tech savvy customers and modern search engines in XML format so they can use it in the way that they want or do you force them through your own interface? </p>
	<p>The barrier to getting involved in &#8220;Web 2.0&#8243; is small, even adding comments and trackbacks to articles on a corporate website in a manner that &#8220;Bloggers&#8221; have been using for many years now is making an nod in the right direction. The very nature of Web 2.0 applications, enables small entities to rapidly place themselves right in the middle of things by providing a useful service eg. Pingomatic.com, or simply brining a collection of &#8220;Web 2.0&#8243; applications together to create a new one greater than the sum of its parts.  </p>
	<h2>Web 3.0</h2>
	<p>Often Web 2.0 discussions involve AJAX (explained below), and some AJAX applications are firmly in the Web 2.0 arena, such as those enabling live updates of RSS feeds or search results as new information becomes available. However what we see AJAX being used for is altogether more interesting, and might well be giving us a glimpse of what Web 3.0 will be about:  <strong>Instantly responsive online applications</strong>, or perhaps more accurately: <strong>Apparently instantly responsive remote interactive applications. </strong><br />
<a href="http://www.google.com/webhp?complete=1&#038;hl=en" target="_blank">Google Suggest is an example of an application where a remote application gives local PC like levels of interactivity.</a></p>
	<p>Moving towards the point where online services become the norm, and the ultimate thin client that is the web-browser will become ever more capable of providing more users with access to the tools that they require. </p>
	<p>Google Earth and Google Suggest are perhaps transitional applications which are giving us the first glimpse of what web 3.0 might offer. While true web applications function through a browser, but there are some emerging application which are way ahead of the curve, such as  Google Earth and Skype which while closely integrated with the web, currently require dedicated client side applications to be run. </p>
	<h2>What is AJAX</h2>
	<p>AJAX, a term describing the use of <b>A</b>synchronous <b>Ja</b>vaScript and <b>X</b>ML, was coined by Jesse Garret in February of 2005, in an <a href="http://www.adaptivepath.com/publications/essays/archives/000385.php" target="_blank">online essay entitled &#8220;AJAX: A New Approach to Web Applications&#8221;</a>. Since then the strict definition proposed by Garrett has been relaxed in common usage to cover online applications which typically have the following key characteristics:</p>
	<ul>
	<li>Personal computer like levels of interactivity with remote applications</li>
	<li>Standards compliant presentation</li>
	<li>Dynamic display and interaction using the <a href="http://www.w3.org/DOM/" target="_blank">Document Object Model</a></li>
	<li>Data interchange and manipulation using open structured data interchange formats</li>
	<li>Asynchronous data retrieval using XMLHttpRequest</li>
	<li>Client side scripting binding everything together</li>
	</ul>
	<h2>Tracking</h2>
	<p>Web 2.0 requires smarter interpretation of statistics, polling AJAX applications, RSS feed readers and other services will result in various new types of traffic on your webserver. As these routes of getting your site / brand / message out become more important it may be necessary to revisit some of the more traditional ways of reporting traffic on websites, as more and more ways are provided for users to interact with data provided by an organisation producing meaningful web-stats becomes more of a challenge. Your webserver could be hit once by a site syndicating your content which could then cascade it on to many of its own users, perhaps along with your own advertising - though Google&#8217;s patent application <em>Embedding advertisements in syndicated content</em> if awarded and upheld could dampen uptake of such approaches (or at least limit you to using Google Ads in conjunction with them).  </p>
	<p><a href="http://www.sci7.com/">Sci7 has expertise in Business Blogging, OpenSearch, RSS and many other technologies that underlie the cutting edge of the web today.</a></p>
	<p>Technorati Tags: <a href="http://technorati.com/tag/web+2.0" rel="tag">Web+2.0</a> | <a href="http://technorati.com/tag/web2.0" rel="tag">Web2.0</a> | <a href="http://technorati.com/tag/AJAX" rel="tag">AJAX</a> | <a href="http://technorati.com/tag/web+3.0" rel="tag">Web+3.0</a>.
</p>
]]></content:encoded>
			<wfw:commentRSS>http://www.sci7.com/cms/47/web-20.html/feed/</wfw:commentRSS>
	</item>
		<item>
		<title>Free Biomedical Images</title>
		<link>http://www.sci7.com/cms/40/free-biomedical-images.html</link>
		<comments>http://www.sci7.com/cms/40/free-biomedical-images.html#comments</comments>
		<pubDate>Sat, 24 Sep 2005 22:56:14 +0000</pubDate>
		<dc:creator>Webmaster</dc:creator>
		
	<category>News</category>
		<guid>http://www.sci7.com/cms/40/free-biomedical-images.html</guid>
		<description><![CDATA[	Sci7 has launched a new web resource making over 22000 life science images easily accessible. The images are made available under open licences which means they available for use by all at no cost.
	http://www.biomedimages.com/
	Sci7 provides advice on sourcing appropriately licensed life-science images and other content for use in a range of media. 
]]></description>
			<content:encoded><![CDATA[	<p>Sci7 has launched a new web resource making over 22000 life science images easily accessible. The images are made available under open licences which means they available for use by all at no cost.</p>
	<p><a href="http://www.biomedimages.com/" target="_blank" title="Free Biomedical Images from Sci7 Ltd.">http://www.biomedimages.com/</a></p>
	<p>Sci7 provides advice on sourcing appropriately licensed life-science images and other content for use in a range of media. </p>
]]></content:encoded>
			<wfw:commentRSS>http://www.sci7.com/cms/40/free-biomedical-images.html/feed/</wfw:commentRSS>
	</item>
	</channel>
</rss>

