The Semantic Web Is Sophisticated Syndication (SWISS)
Going beyond Peer-to-Peer with Many-to-Many information sharing.Table of Contents
- The Semantic Web Is Sophisticated Syndication (SWISS)
- Synopsis
- Motivation
- Architecture
- Social Web Services
- Web Clipping
- Identity & Authentication
- Spam
- What's semantic about it?
- DBin
- Getting Graphs
- Related Articles
- RDF & REST are a perfect pair
- Related Services
- Diggo
- ArtFeed
- LOCKSS
- DabbleDB
- CoComment
- Online Copyright Syndication
- VOIP & IM
Synopsis
SWISS enables the disintermediation of WikiWiki, blogging, and other collaborative web application systems.Motivation
Most current web collaboration technology depend on central administration in order to operate (there are a few notable exceptions such as SMTP, and, to an extent, USENET and IRC). While that architecture often suits the commercial interests of the typical operators of such systems, it also fundamentally limits the number of people who can use them. Not only are there intractable issues involving economics and policies (both administrative and editorial[2]), but users also surrender their data freedom and privacy rights to the service provider.This situation is very much like the 1960s in which computer services were centrally controlled with all of the compromises of users' interests which that entails. What users desire is the independence and availability of personal computer desktop systems, particularly for essential applications and their data. Of course they also desire the freedom from maintenance and broad reach of managed web application services.
Another severe flaw in centralized architectures is that they
don't scale effectively. When a system is fortunate enough to be
wildly successful, whether briefly as in the "Slashdot effect" or longer term like MySpace[1], then it
faces a surge in load that is overwhelming and can be extremely
difficult and expensive to even partially satisfy. As Google's
success illustrates, it is essential to have a system architecture
that is inherently distributed in order to reach everyone in the
world (see HowGoogleWorks). Google Desktop also illustrates
that a comprehensive solution must support operations with and
without an Internet connection.
Architecture
SWISS is a technique for sharing information (collaboration) that only requires static content servers for public services. In addition these public servers don't "own" the information they present but instead are primarily used to provide aggregated views of data and topical reference points.The needed dynamic services are scalable ranging from single user (desktop hosted personal web application servers) up to whatever is desired. Well behaved application servers will also not own their data, but rather expose it fully (to those with appropriate permission) in transportable RDF.
The user benefits provided by the editorial and administrative policies of centralized Wiki servers becomes a smorgasbord of opt-in choices from a menu of automated rules and grassroots user affinity groups.
The practice of these policy systems will be a lot like security policies in Internet Explorer. The user gets to choose from a detailed list of decisions, which come in a few precooked prix fixe choices as well as ala carte. Then in enterprise and corporate deployment, those security policy choices and their maintenance for whole groups of users is delegated to IT administrators.
While SWISS operates using polling for syndication in the public Internet. Thus SWISS is extending the successful RSS technology into applications beyond blogs.
But users like the immediacy of the web, and as the number of
users gets large polling runs into latency vs bandwidth &
performance issues (although I have an idea for a
nifty hack), so we also support messaging. While there are
numerous messaging technologies out there, SMTP email is by far the
most successful and must be supported (not surprising since it is
the foundational Internet collaboration medium). Google again is
showing the way in how the web and email can be combined for a
top-notch user experience with GMail. For chat-style application
IRC is the granddaddy, but
XMPP
is probably the way to go for instant messaging (although SIP is important too). The Dylan Wiki
supports XMPP for change notification.
Social Web Services
- Newsgroups
- Chat (IRC, Jabber, Twitter)
- Bookmarks
- Address Book
- Forums
- Blog
- WikiWiki
- Files
- Web Clipping
- Auctions & Trading
- Date & Mate Finding
- Genealogy
- Games (from casual to MMPORPG and first person)
- Contact Networking (Six Degrees (R.I.P.), LinkedIn, Classmates/Reunion)
- Photos & Videos
- Podcast
Web Clipping
- Bloglines has a clipping feature
- http://www.bloglines.com/help/faq#saveditems
- Scrapbook Firefox extension
- https://addons.mozilla.org/firefox/427/
. This tool
would be a great base for an annotation extension for Firefox, except that involves Firefox extension hacking.
- Clipmarks
- http://clipmarks.com/
- Clipmarks Firefox extension
- https://addons.mozilla.org/firefox/1407/
Identity & Authentication
Identity deserves it's own topic.Spam
Automatic spam checking is another big topic. In a SWISS blog or Wiki, managing comment spam is trivial; just drop the offending source from the syndication list and regenerate the page.Spamato is an OSS (GPL) spam
checking framework in Java.
This article discusses processing email messages
with Mule and Spring.
What's semantic about it?
The reason semantic web technology, and particularly RDF, is important is because it provides the data format needed to link together the information needed for social/collaboration applications.Marc Eisenstadt cites some nice resources on how this will work:
http://kmi.open.ac.uk/people/marc/2006/06/29/social-semantic-web-20-peter-mika/. See also Folksonomies.
And the thing that will make this Semantic Web jazz as successful as HTML is the lovely Microformat/CSS friendly Embedded RDF.
DBin
DBin implements an ideal "native" (that is RDF) disintermediated many-to-many data sharing framework.Getting Graphs
Differencing in the sense of Harmony will get the documents into RDF (and back in some cases), while OOHTML will enable RDF to get into documents.Related Articles
Jon Udell talks about OpenFount; a tool for implementing services hosted by a web storage
server (Amazon S3):
http://weblog.infoworld.com/udell/2006/07/07.html#a1483
Amazon now has Simple Queue
Service in
addition to their Simple Storage Service. That would be an
attractive alternative to SMTP (especially since it is widely
blocked now) or OpenFount's OSCS
for AJAX SWISS applications, except
that they currently are only available on separate hosts (thus
inaccessible to ordinary Javascript). Fortunately they know about the issue
and I'm sure it will be resolved (with
hosting options on the accounts I expect).
This article Architectures of Participation: The Next Big
Thing
discusses Metcalfe's law
at work in "Web 2.0" social networking sites. It also
cites YouTube as the "#1" Internet site because of the huge
bandwidth it is using. That illustrates one of the fundamental
weaknesses in any collaboration technology that depends on
centralized administration (and Akamai is only a bandaid).
In this interview of Tim Bray and Radia Perlman Richard MacManus has the
Sun folk espousing exactly the opposite view. They assert that
"true" P2P won't work because people want services that depend on
identity (true) and that requires centralized servers (false). But
this isn't surprising since Tim has already demonstrated he doesn't
get it by supporting Atom and then being a hypocrite by telling folks to not create spurious XML
formats
.
Anyhow, while they can be forgiven for not knowing about SWISS
because it isn't built yet, a clear refutation of the idea that
central servers are needed is DBin.
Marc Eisenstadt blogs on how the Buddy list is key to future publish/aggregate
world.
Actually he says how he doesn't think that is quite true, but links
to those who do such as Steve Boyd who also talks about The Conversational index
.
RDF & REST are a perfect pair
http://blogs.sun.com/bblfish/entry/rest_without_rdf_is_only
Related Services
These are mostly server-mediated services, but are good examples of the kinds of collaboration tools I like.Diggo
Diggo provides some of the information sharing tools that IFCX will have (particularly the Annotea annotation features), but it is a typical centralized service:http://www.diigo.com/

Some articles on Diggo:
http://www.solutionwatch.com/474/diigo-launches-more-than-just-bookmarking/
http://reviews.cnet.com/4531-10921_7-6624081.html?tag=nl.e404
ArtFeed
Easy RSS for artists. http://www.artfeed.org/
LOCKSS
Lots Of Copies Keep Stuff Safe is a distributed digital library project. http://www.lockss.org/
DabbleDB
DabbleDB
CoComment
http://www.cocomment.com/
Online Copyright Syndication
This article

VOIP & IM
While XMPP is suited for web pages, SIP is the open VOIP protocol. The two come together for the ultimate in interactivity.SIP Communicator http://www.sip-communicator.org/ is VOIP in Java (although FLASH
solutions would probably be better for most folks, it even supports
cameras).
[#1] See this Baseline Magazine article "Inside
MySpace" on
MySpaces's ramp up, which also mentions how these scaling issues
stalled Friendster and allowed MySpace to take the lead and achieve
crtical mass.
[#2] Prime examples being disputes over such things as
the application of Wikipedia:Wikipedia:Deletion_policy to most any Wikipedia:Criticism_of_Wikipedia
. That has actually now resulted in
the high visibility fork Wikipedia: Citizendium
which is here http://www.citizendium.org/
.