Browse > Home / Archive by category 'Sorting and Storing'

| Subscribe via RSS

Furl users transfer to diigo

March 18th, 2009 | 2 Comments | Posted by Colin Meek in Featured, Social Networks, Sorting and Storing

diigo The much loved research tool Furl is being absorbed by Diigo – a social bookmarking tool for serious research.
Like many, over the years I’ve found Furl crucial for research when I’ve needed access to saved versions of pages – not just bookmarks. Unlike delicious, Furl let me save whole pages to its servers rather than just the link. It also came with a heap of other tools that let you network with other users. Out of the blue, however, this week Furl’s one million users were told that diigo has acquired Furl after Furl’s owners – LookSmart – changed focus.
‘We worked hard to find Furl a home where loyal users like you could continue to benefit from best-of-breed social bookmarking and annotation tools,’ Furl said. ‘Hands down, Diigo was the winner due to its innovative approach to online research tools and knowledge sharing.’
Again, Diigo is probably a more reliable and flexible research tool than delicious. You can:

  • highlight parts of web pages and archive those section;
  • attach ‘sticky’ notes to pages;
  • save pages and your notes to Diigo’s servers;
  • share saved files to with a project team;
  • network with other Diigo users by contacting them directly or watching what they save; and,
  • explore by tag.

As I’ve stressed before, if you’re involved in serious research one of the biggest problems with delicious is the fact that web links can become inactive very quickly. If you need reliable access to your source material quickly – you need another solution. I’ve no experience with Diigo so I’ll monitor its service over the next few weeks.

In the meantime, if you’re a Furl user, Diigo has set up an easy way to transfer your archive. More on Diigo soon.

Tags: , , , , , ,

Notebook options: Yahoo Search Pad; Snapbits; and, notebook G

February 15th, 2009 | No Comments | Posted by Colin Meek in Featured, Search tools and tricks, Sorting and Storing, Your own privacy

I’ve covered notebooks such as Evernote in some depth but there are a range of other tools available that help you organise clips, bookmarks, notes, documents and screenshots in ‘notebooks’.

Here are three others:

notebook G
notebook-g “notebook G is an online personal information organizer you can use to keep track of the various notes for home, school and work. Use notebook G as a replacement for a paper-based spiral notebook, or as a supplement to paper-based organizers.
Since notebook G is web-based, your data follows you wherever you go. And your data is protected from viruses and spyware, as well as backed up to a world-class data protection firm.”

SnapBits
SnapBits is a free service that may be used to securely store, search, sort and easily retrieve “Bits” of information. These “Bits” may be anything you wish to remember or keep a note of. You may store as many as you like and unlike conventional paper notes, you will never lose or fail to find your information.”

Yahoo’s Search Pad
Yahoo’s notebook application is due to be released soon. Here’s a preview:

Tags: , , , , , ,

Easy transfer from Google Notebook to Evernote and Zoho

January 29th, 2009 | No Comments | Posted by Colin Meek in Sorting and Storing

If, like me, you were shocked by Google’s announcement that they are to abandon development of its highly rated Notebook app – help it at hand. Two of the best notebooks have launched importers that enable evernotelogoGoogle Notebook users to easily switch to Evernote or Zoho.
Evernote’s instructions include a video tutorial while over at Zoho you’ll need a simple Firefox plugin.

If you haven’t tried a notebook app before then see my post on the 10 reasons why journalists and researchers should love Evernote.

Tags: , , ,

Nepomuk – the social semantic desktop

November 27th, 2008 | No Comments | Posted by Colin Meek in Semantic web, Social Networks, Sorting and Storing

Last month I talked to John Breslin about how web applications may become semantically integrated with your desktop applications. In other words, the ‘semantic web’ isn’t just about improving search, it is also about allowing all the systems you use to interoperate with each other. For example, imagine browsing your RSS feeds and a name of a post author is highlighted on your screen. Your desktop computer then makes connections between that person and your contacts file and your friends in social networking sites.

John and I could have been talking about Nepomuk – the social semantic desktop, which was reviewed by physorg.com this week. This EU-funded project aims to apply the kind of semantic solutions being deployed on the web to your desktop allowing your desktop to more easily process the information it has across different platforms, media types and applications.

The practical result could be that you will be able to access information on your desktop using semantic technology in the same way that you will be able to make connections semantically online.

Tags: , , , , , ,

The Semantic Web today – An Interview with John Breslin

Not many people are as close to the heart of the Semantic Web as John Breslin. John is the founder of the Semantically-Interlinked Online Communities (SIOC) project, a member of the W3C Advisory Committee, lecturer at the National University of Ireland and an associate researcher on the semantic web at the Digital Research Institute in Galway.

I caught up with John recently for this analysis of the semantic web and journalism published in journalism.co.uk. We thought John’s points were so interesting that we’ve brought you the full interview in this post.

Niche social networks…
Q: Some have predicted that the rise in the universal social network sites such as Facebook and Myspace will be mirrored soon by the explosion in growth of niche sites such as Xing and Peer Trainer and the expanding interest in ‘enterprise2.0′. In many ways this may accelerate the the demand for semantic-type applications that allow people to travel seamlessly through various social networking services. What do you think?

John Breslin: I think that even though some have argued against the need for niche social networking services (SNSs) due to the widespread use of large sites like Facebook and MySpace, these niche SNSs can provide a breath of fresh air when one wants to escape from the bigger “overcrowded SNS cities”. As long as a niche SNS or community site provides regularly updated and relevant content to a steady or growing set of users, there is no reason that such sites should not survive or even flourish on the Web. As pointed out by Paul Gibler in his online article “The Expanding World of Social Networking”, it is the fine-grained and targeted communities such as CafeMom, BOOMj and PEERtrainer that are experiencing recent growth. This also ties in with the idea of object-centered sociality, where people don’t just connect randomly online but rather through the (niche) interests that they have in common. Mark O’Neill sums it up nicely: “…by organizing networks centrifugally around objects, social networking sites have meaning, even when they do not have 200 million users and even when they are centered around minority interests (like Thomas Kinkade paintings!). The point is that they are centered on objects which are in common.” As you say, a key is to allow people to seamlessly find and navigate through these niche interests, and that’s where projects like OpenID, FOAF and SIOC can help – from the point of view of having a single login that’s tied to your interests which can then be semantically matched to content items created across many communities.

Social network portability…
Q: There are several projects set up to address the issue of social network portability – allowing you to interact with various social networks more easily. In your view, will most people need to get used to the concept of a single global online identity such as FOAF?

John Breslin: I think that people are tired of repeating the same information in multiple places, and through standard signon systems like OpenID and profile representation mechanisms like FOAF, you can allow someone to define their identity and to reuse it wherever they choose to use it.

Tech stacks…
Q: You’ve described how a ‘social networking layer may be folded into tech stacks’ where your web and desktop application layer can tap into an integrated social networking stack. For me, this opened my eyes to how important the shift to the semantic will be. I think many people assume that the semantic web will usher in a new period of improved search. But, in fact, it will utterly change the we we interact with the internet?

John Breslin: A lot of the focus from the public or media regarding the semantic web has been in relation to search. But it’s not solely about finding those relevant objects (people, places, etc.) through “Google killers”, and its not only about the Internet (despite being called Web3.0!), but it’s also about providing ways to allow systems (on the desktop, or the Web, or media servers, whatever) to interoperate with each other as well. The social networking stack is one nice example, and indeed efforts like the Social Semantic Desktop and Social Semantic Web can interoperate through such a stack. It may also be for migration between different collaborative workspaces or social software systems, as we are doing with the SIOC project.

Your online identity…
Q: You’ve also suggested that online communities should provide their data in a common, machine-understandable way and should use common semantics to define this data (SIOC and FOAF). The way semantic services will be deployed is unpredictable but do you envisage people signing up to new social networks and setting up a profile automatically using their FOAF file? In the future, do you think people who want to network with each other will swap FOAF files and these files will include relevant information about social network membership?

John Breslin: Yes, and this is being done to some extent already. But also it’d be nice to not just bring your personal profile and your friends with you (for example, via FOAF) but perhaps your content as well (maybe defined using SIOC). There are some issues related to both transporting your friends (need their permission) and comments attached to your content (may need the permission of those commenters too), but you should at the very least be able to bring what belongs to you (your profile and your content), for example along the guidelines of the “Bill of Rights for Users of the Social Web” by Canter et al.

Meshing of networks…
Q: A practical consequence of SIOC might be that you might do a search in Facebook using the term ‘bog-snorkelling’ and gets results back that may include profile pages that include that term, but also blog results from Technorati, comments from Flickr albums and YouTube videos? Equally, a practical consequence of SKOS, FOAF and SIOC could be that you click on a tag for ‘bog snorkelling’ in Delicious and get results from a range of social network sites?

John Breslin: Exactly! I’m delighted that Yahoo! SearchMonkey have listed SIOC as one of their recommended vocabularies – and that people are now starting to get the idea of being able to retrieve user-generated content items from all or from specific types of social websites (blogs, forums, mailing lists, photo albums) using mechanisms like SIOC and FOAF. Through people defining interests explicitly using something like a foaf:interest field or implicitly by clicking on tags of interest, relevant content can be easily returned from social websites with appropriate dc:subject or sioc:topic metadata.

Practical implications…
Q:A practical result being that you create a new account with a new social network and that SN can identify other people on that Network who are listed in Bob’s defined relationships. Have any social networks already deployed this service?

John Breslin: There are many sites (e.g. Dopplr) that are starting to allow you to bring your friends with you by specifying something like your GMail account details (and then matching e-mail addresses you use) or your Twitter account details (and then retrieving a list of those whose microblogs you follow), but it is certainly useful to have a smaller set of reusable relationship formats that can make this more widespread (and that extends the number of services that you can import from). The Google SocialGraph API is a nice example of something that can enable this, as it allows applications to reuse social graph information extracted from sources all over the Web and represented using the open formats XFN and FOAF.

Searching the semantic layer…
Q: I’m a bit confused by the SIOC RDF Browser and if there are any applications that currently allow one to browse information expressed in RDF and SIOC ontology – I assume you need specific URLs to use this?

The SIOC RDF browser is simply a way to view RDF information in a more human friendly form. One of the motivations for creating this was to enable people to view semantic information easily because it may have different aspects that can be of interest – it may be the same information you see on a normal web page, but it may also contain extra information that is not normally displayed on a web page but is rather hidden or locked into a database and that information may prove useful for some third-party applications (e.g. a modification date, incoming links), or perhaps some extra information can be calculated or inferred for a semantic page (related content on the same topic, tag usage frequencies, etc.)

Semantic search…
Q: From the perspective of the non-technical lay researcher – where does Sindice (the semantic web index) and other semantic search tools fit in?

Sindice can be thought of as a big semantic index of the Web. It allows you to find pointers to relevant pages or URIs where particular keywords are mentioned, where certain property values are used (e.g. pages where a person says their e-mail address is john.breslin@deri.org), or where certain facts or semantic tripples appear. If you’re looking for a “semantic search engine”, it depends on what you need. Sindice gives you pointers to where stuff is, whereas many other engines give you the stuff as well (without you having to go to the source page).
SWSE (also from DERI) and Swoogle allow query capabilities over the collections of all Semantic Web statements – so if you search for Galway, it can show you the relevant statements as well as pointing you to the pages they were obtained from.

But I think the applications of Sindice, i.e. finding pointers to where stuff is, and using that in third-party applications, are quite interesting. For example, the SIOC Widget for WordPress is powered through a combination of distributed SIOC documents and the Sindice index. So, when you are browsing a blog that has this widget installed, you may see little balloons appearing beside commenters names. Clicking on these balloons shows a pop-up with a list of content (posts, comments, topics) that that commenter has created not just on the blog site you are viewing but across a range of SIOC-enabled websites (blogs, forums, mailing lists, whatever) as indexed in Sindice. Here is a picture. So you can see and navigate to the content a person has created across a range of sites from just one place that they post to.

On the cusp…
Q:Moving on to practical applications. I was interested to read the “On the cusp” by David Provost. In it he concludes that companies are on the verge of constructing very practical and commercially viable semantic applications. Do you agree?

John Breslin: I think that we are now beginning to see the real commercial applications of what can be done when all kinds of things on the Web are connected together using semantics. This is obvious in the attention being given to startup companies in this space like Powerset, Metaweb (Freebase) and Radar Networks (Twine), and also since many big companies including Reuters (Calais API), Yahoo! (SearchMonkey) and Google (Social Graph API) have all announced in 2008 what they are doing with semantic data.

There has been a lot of talk this past year about the social graph (notably from Google’s Brad Fitzpatrick), which looks at how people are connected together (friends, colleagues, neighbours, etc.), and how such connections can be leveraged across websites. In the Semantic Web, it is not just people who are connected together in some meaningful way, but documents, events, places, hobbies, pictures, you name it! And it is the commercial applications that exploit these connections that are now becoming interesting. But it is very important that the users aren’t exposed to any RDF or semantic terminology – through usage, they just “get” the fact that everything is interconnected.

And the best product?…
Q: In your view – what are the most exciting semantic product developments to have emerged in the last year?

John Breslin: I really like Radar’s Twine, the “knowledge networking” application that allows users to share, organise, and find information with people they trust. I find Twine very interesting, and as well as using it to gather information about SIOC for regular blog entries I write (“Tales from the SIOC-o-sphere”), I also use it to gather and publish personal interests that I think will be of interest to the public, and for passing on interesting stuff to work colleagues.

Privacy…
Q: What about the privacy angle. Are the privacy safeguards in place capable evolving to meet this challenge? Does the average LiveJournal user know that their profiile has been converted to a FOAF file and is now translatable by any number of new semantic products? Speaking as a journalist, my hunch is that the vast majority of people are going to be surprised and, perhaps, shocked to know that a public comment then make on Livejournal may end up in a database that is searchable by people in Linkedin.

No, certainly people aren’t aware that many sites are making semantic forms of their content available which can be reused elsewhere. Tribe.net recently turned off their FOAF exports after a user complained that his/her profile was being copied for use elsewhere (the original developer team had moved on so the new developers weren’t sympathetic to the possibilities of the Semantic Web). Similar things happened with people blogging and finding that content from their RSS feeds was popping up on other sites. There certainly has to be more thought put into educating users and towards having opt-in-opt-out mechanisms when implementing semantic exports, especially for personal content and profiles.

Thanks to John for his time with this.

Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,

When you have to cite your online sources

October 20th, 2008 | 3 Comments | Posted by Colin Meek in Featured, investigative strategies, Sorting and Storing

There is no doubt that the many social bookmarking tools that exist often offer great networking services and ways you can monitor areas of interest effectively. But you can run into problems if you rely on a social bookmarking site as a file to store away your online sources. If you are working on a sensitive story that relies on web-sourced evidence then that can evaporate if the site owner pulls the page, alters the relevant text or changes an image.

I worked for years on the BMJ Best Treatments project and out-of-date links to academic sources and public-targeted government advice were a continual headache. One study has found that 13% of internet references in scholarly articles become inactive after only 27 months. But perhaps an even bigger problem is when the link stays intact but the content changes. If your story depends on a particular source and it changes then where do you turn? Does your publication have a policy on this?

Some social bookmarking sites offer a partial solution. Furl.net lets you save the entire page to Furl’s servers where you can access an exact copy of the page without going to the source URL. I always use Furl when I’m working on anything even remotely sensitive. Another tool – WebCite – allows you to copy web pages and store them remotely. The difference here is that WebCite gives you a way of citing a source permanently. When you use WebCite in a published document you cite the original source URL and a WebCite reference and you can be sure the WebCite link won’t change. Readers can then click on the WebCite link for the archived version. WebCite is supported by a range of academic and scientific publishers who already have an incentive to keep the service running. Check out its FAQ for information about its funding and security.

While many journalists find services and tools such as the Way Back Machine useful there is no way you can rely on it to cite a source. WebCite offers a range of other tools including a ‘WebCite This’ button for bloggers. See here for more on that.

Tags: , ,

EverNote – 10 reasons why journalists and researchers should love it

October 2nd, 2008 | 5 Comments | Posted by Colin Meek in Filter content, Social Networks, Sorting and Storing

Over the last few years a number of notebook applications have elbowed their way into the mainstream market including Google Notebook, Zoho Notebook, Ubernote and iLeonardo. A few of these apps were discussed in this Mashable post recently. One of the most recent note-taking apps to emerge is Evernote – an incredibly powerful creation that works on both Macs and Windows. I’ve looked at Evernote from the perspective of a journalist and researcher and found ten reasons to love it.

1 – It does the basics really well. I hope the people at EverNote never lose sight of this. Journalists and researchers use notebook apps so they can quickly and efficiently file away snippets, web pages, documents and pictures into subject files. They also need to be able to dump this content into the note-taking app in a variety of ways and access those notes easily. Evernote has this mastered.
2 – EverNote works on the web and your desktop. Sounds like a source of confusion but the automatic syncing means both operate seamlessly. You can, however, create notebooks only on the web version or only on your desktop.
3 – You can publish your notebooks to collaborate with colleagues on projects.
4 – You can tag notes. Making it easier to search for notes across and within notebooks.
5 – Notes can be emailed easily. Single or multiple notes can be selected and then one click attaches those notes as PDFs to a new message window from your email client.
6 – You can email notes directly into your EverNote account.
7 – EverNote search includes image recognition technology. While not universally applauded, this image recognition capability means you can automatically search through pictures that carry text. If you have taken snaps of business cards, for example, you can search them automatically.
8 – You can import and export. This means that as you build data using the service and catalogue items you can export them later to a different app on a different platform.
9 – The web and desktop interfaces work beautifully. You can view notes in various forms, zoom in and out, view specific notes in one column or work on a specific note by adding text or images.
10 – It’s about to get better. Yesterday EverNote announced released its all-access API which means that developers can mash the service with other apps. The company hopes, for example, that it can be integrated with scheduling services such as Ical and Remember the Milk. This also means that the social bookmarking element of EverNote (published notebooks) may become intergrated with other social networking sites. It also says that new Mac and Windows scripting capabilities mean that you can automate specific processes and add functionality such as RSS feeds

There are other reasons why EverNote has attracted praise; for example, I’ve yet to explore how easily you can create and browse notes via mobiles. One downside might be that the free version allows a maximum data upload of 40mb per month. Even so, $5 will give you access to 500mb which is enough to store 5,000 snaps.

More on Evernote mashups soon.

Tags: , , , ,

Notebook 2.0: 12 Tools for Researchers – Mashable

September 30th, 2008 | No Comments | Posted by Colin Meek in Sorting and Storing

Here’s a useful guide to the various ‘notebook’ options available. Most of these allow you to save whole web pages or page clips, emails, pictures and other content to ‘notebooks’ organised by subject. I’m using Evernote which I’ll be reviewing soon. 

Notebook 2.0: 12 Tools for Researchers – Mashable -

Tags: ,

Text 2 Mind Map

September 22nd, 2008 | No Comments | Posted by Colin Meek in Asides, Sorting and Storing

 Text 2 Mind Map – The text-to-mind-map converter – If, like me, you permanently have several mind-maps littering your desk then you might find Text 2 Mind Map helpful.

Tags:

Welcome

Welcome to insite. A www.journalism.co.uk blog that will cover everything related to internet research. insite is written and edited by Colin Meek who delivers training courses in Advanced Internet Research for journalism.co.uk the National Union of Journalists and for other clients on an in-house basis.