2010-11-10

What is your vision on Web 3.0? What are the key technologies that will make it happen?



The term Web 3.0 is a very hot topic nowadays. However, as popular as it is, its definition is still not confirmed yet thanks to the many different meanings given to it by several experts which makes the definition of Web 3.0 one of the hottest debates in the history of the World Wide Web. According to the Wikipedia, an online encyclopedia, Web 3.0 is a third generation of Internet based Web services which emphasizes on machine-facilitated understanding of information in order to provide a more productive and intuitive user experience. The term "machine-facilitated" is very important as it summaries what Web 3.0 is all about. Unlike Web 2.0 which is about people communicating, contributing and collaborating and the results coming from the wisdom of crowds, Web 3.0 derives its "wisdom" from software that learns by searching, analyzing and drawing conclusions from online contents. Instead of depending on people to refine information and opinion, intelligent software would do the same thing. This transformation of the Web into an artificial intelligence technology, the Semantic web, is in my opinion what defines Web 3.0.

In the perspective of Semantic Web, Web 3.0 is an evolving extension of the World Wide Web in which web content can be expressed not only in its natural language but also in a form that can be understood, interpreted and used by software applications and thus permitting them to find, share and integrate information more easily. The Semantic Web was first thought up by Tim Berners-Lee, the inventor of first World Wide Web, and expressed as follows:

"I have a dream for the Web [in which computers] become capable of analyzing all the data on the Web – the content, links, and transactions between people and computers. A 'Semantic Web', which should make this possible, has yet to emerge, but when it does, the day-to-day mechanisms of trade, bureaucracy and our daily lives will be handled by machines talking to machines. The 'intelligent agents' people have touted for ages will finally materialize."
– Tim Berners-Lee, 1999

What is my rationale behind for defining Web 3.0 as Semantic Web? It is that I see the Semantic Web as a huge engineering solution for generalizing the format of the data on the Web in the form that can be easily processed by anyone and any system which makes the idea of Web 3.0 to be possible. The problem now for Web 2.0 is that majority of data published on the Web is generally hidden away in HTML files as the format of HTML is not as standardized as one may want to which makes it difficult for other systems and software applications to extract the data from the HTML web page. For example, if a HTML web page that is used to provide weather information to other software applications has changed its design and style, then those software applications have to change their methods of extracting information from the web page too to adapt to the new format of the HTML files. This is not workable as one may not know when the other web pages will change their HTML design and style. As a result, more and more web technologies such as RDF, OWL, SWRL and SPARQL have been developed with the aim of generalizing the format of the data on the Web in order to make the data understandable by other systems and software applications and thus evolve the World Wide Web into a new generation of Web 3.0 and Semantic Web. In this article below, I will try to describe some of the Semantic web technologies that will make this evolution into Web 3.0 possible.

The Semantic Web is generally built on syntaxes which utilize the triple based structures of the Uniform Resource Identifier (URI) to represent data. These syntaxes are called "Resource Description Framework" (RDF) syntaxes. RDF is designed to be read and understood by computers and not for being displayed to people. It is usually written in XML and makes use of XML syntax such as namespaces. RDF is often used as a framework for describing Web resources such as the title, author, modification date, content, and copyright information of a Web page and can be illustrated in the following example:

<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:dc="http://rom1023.blogspot.com/">
    <rdf:Description rdf:about=" http://rom1023.blogspot.com/">
        <dc:title>What is your vision on Web 3.0? What are the key technologies that will make it happen?</dc:title>
                <dc:author>Roman</dc:author>
    </rdf:Description>
</rdf:RDF>

This piece of RDF basically says that this article has the title "What is your vision on Web 3.0? What are the key technologies that will make it happen?" and was written by someone whose name is "Roman".

The benefit for using RDF is that the information maps directly and unambiguously to a model which is decentralized and the data represented in RDF form is easy to be processed since RDF is a generic format which already has many parsers. This means that when you have a RDF application, you know which data are the semantics of the application and which data are just syntactic fluff. Thus, the data represented in RDF form can become a part of the Semantic Web.

Another popular Semantic Web technology is the Web Ontology Language (OWL) which is designed for software applications to process the online content instead of just presenting information to humans. OWL can facilitate machines to interpret the Web content better by providing additional vocabulary along with a formal semantics to describe the meaning of the terminology used in Web documents. This helps machines to perform useful reasoning tasks on these Web documents and thus be able to process any Web content that can be published by anyone. OWL provides three sublanguages designed for use by specific communities of implementers and users respectively and they are OWL Lite, OWL DL, OWL Full which is ordered by increasing expressiveness. Each of these sublanguages is an extension of its simpler predecessor. The following set of relations hold but their inverses do not:

  • Every legal OWL Lite ontology is a legal OWL DL ontology.
  • Every legal OWL DL ontology is a legal OWL Full ontology.
  • Every valid OWL Lite conclusion is a valid OWL DL conclusion.
  • Every valid OWL DL conclusion is a valid OWL Full conclusion.
The OWL family of languages support a variety of syntaxes and they are grouped into two categories which are high level syntaxes and exchange syntaxes. High level syntaxes are for specification and exchange syntaxes are more for general use. The below examples, which are taken from the W3C, show the syntax of the OWL2 Functional Syntax which is a high level syntax and OWL2 XML Syntax which is an exchange syntax.

OWL2 Functional Syntax

Ontology(
Declaration( Class( :Tea ) )
)

OWL2 XML Syntax

<Ontology ontologyIRI="http://example.com/tea.owl" ...>
   <Prefix name="owl" IRI="http://www.w3.org/2002/07/owl#"/>
   <Declaration>
     <Class IRI="Tea"/>
   </Declaration>
 </Ontology>

Consider an ontology for tea based on a Tea class as shown above. Every OWL ontology must be identified by an URI (http://www.example.org/tea.owl in this case). To save space, preambles and prefix definitions have been skipped for the above examples.

The main power of Semantic Web languages is that anyone can publish the languages easily by using the RDF that describes a set of URIs without the fear that they might get misinterpreted or stolen and with the knowledge that anyone in the world that has a generic RDF processor can use them. However, although the future of the Semantic Web technologies appears to be bright, they are still very much in their infant stage and there seems to be little consensus about the likely direction and characteristics of the early Semantic Web. Nevertheless, I still believe that the Semantic Web is a vision of the future of the World Wide Web and is what that defines the generation of Web 3.0.


References


0 Comments:

Post a Comment