Recently in On-line research Category

Sente on the iPad

| 22 Comments
Well, Apple has now made the iPad official.  We think it looks like a wonderful device and one that will be very useful in the academic world.  Now many people want to know what our plans are for supporting the iPad.

The bottom line is that we are excited about the new platform and we will release a version of Sente for the iPad as soon as we can.

We are not yet prepared to commit to any particular functionality, but we think some of the key characteristics of iPad Sente are rather obvious, so there is little advantage to keeping them secret.  Here are some of the key features that we expect the application to include:

Mobile libraries synchronized with desktop Sente.  Frankly, many of the design decisions that went into synchronized libraries in Sente 6.0 were made with the rumored tablet in mind and we think Sente's synchronization mechanism is well-suited to the task.  One library, multiple devices.  New references and edits show up everywhere almost immediately.  And "download on demand" for attachments means that you can have as many or as few PDFs on your iPad as you want, and quickly obtain any others as needed with a single click.

On-the-go PDF reading, markup, tagging and note-taking.  The iPad is clearly much better suited to reading PDFs than the iPhone.  To make this most useful, it is essential that users be able to do markup, note taking and tagging as they read.  And, of course, all markup, notes and tags will be quickly and automatically synchronized.

We would not normally make such public statements this early in the development cycle -- one never knows exactly what kinds of unexpected hurdles will be found as we work on the software --  but in this case we think our users should know where we are headed.  We have been hoping for and planning for the iPad for some time, and we are very excited to be able to begin serious development.  

Anticipating the next question... we do not yet know when this application will be released.  When we know more, we will tell everyone, but right now we simply do not have a schedule.

Now we want to hear from you.  Would you like to see Sente on the iPad?  How would you use it?  What do you think the must-have features are?

Michael

Why you should care about APIs

| 2 Comments
Chances are pretty good that you have not really given much thought to Application Programming Interfaces (or APIs) before.  And this is for good reason -- your field of study probably has little to do with computer technology.  But I am hoping you will spend a couple of minutes reading this post to see why APIs actually do matter to you and how they could make your life easier and more productive.

Today, we released Sente 5.7.9.  There were a number of small changes in this release, but the primary reason we had to get this out is because Google changed the structure of the HTML in their Scholar pages and these changes broke our reference detector code (the code that puts the little target icons next to each reference).  So, for the last couple of days Sente users have been unable to get references from Google Scholar pages.

Sente 5.7.9 fixes this problem, but the underlying problem is the lack of a stable API to Google Scholar (and to many other resources).

So, what exactly is an API?  It is an interface for use not by people, but by programs.  A user interface, or UI, is designed to be used by a person. An API is designed to be used by other programs.

An example might help.  If you use Apple's Mail program and you have a MobileMe account, you can access your email either through MobileMe's web-based user interface, or you can read your email in Mail.  The Mail application uses several APIs at MobileMe to get and send mail (e.g., SMTP, IMAP, POP).  If Mail could not use these APIs, it would be forced to try to extract mail messages from the web interface, which would be very difficult, slow and error prone.  And, every time someone at MobileMe tweaked the way the web page looked, the Mail application would have to be modified to handle the changes.

This is where we are with respect to Google Scholar and hundreds of other data sources.  Our targeted browsing code looks at the structure of each web page that it gets and tries to find references embedded within the page. This feature can be very useful, but it is not nearly as useful as it could be if each site supported some standard APIs for searching and retrieving reference data.  And, every time one of the supported sites changes their HTML, we have to modify Sente to handle the change.

Contrast this with PubMed.  PubMed has supported a well-defined, relatively stable API for accessing reference data (and other types of data) for many years now.  This means that Sente can do more things, more quickly and more reliably, with PubMed than we can with most other data sources.  And our users in the bio-medical sciences benefit directly from this.

People in the humanities often accuse developers of academic reference managers like Sente of caring more about biology than about French literature, or medieval art.  This is not the case.  The problem is that the data sources in the humanities (and many other fields) are far, far behind the data sources in biology.  

Instead of treating citation data (titles, author names, abstracts, publication details, etc.) as a little advertisement that will attract readers to their publication, many publishers in the humanities (and other fields) treat citation data as an asset that they can license to services like Thomson and EBSCOhost, who in turn sell access to institutions.  I think it is fair to say that the primary motivation for this way of doing things is not to provide the best support possible for academic research.  (Note that I am not talking about free access to the full text of all articles; just to the basic citation data about each reference.)

Now, you might be thinking that I have confused two issues -- APIs and the licensing of reference data -- and, to some extent I have.  But they are related.  Google provides an API for regular Google searches.  But not for Google Scholar.  Why?  Because the academic publishers would not agree to this because they were afraid of losing their income from licensing their data to resellers.  Thus, we are reduced to trying to keep up with each little change in the HTML that Google makes to their Scholar search results page.

So, back to the original question: why should you care about APIs?  Because the lack of support for open, stable APIs to basic reference data in many fields is holding back everyone  those fields.  We will continue to find ways to make it easier to search the literature in all fields of academic study, but we would be more successful if the publishers actually wanted this to work.

Many of you have, or will have, significant influence on one or more of the academic publishers in your fields.  You may be on the editorial board for a journal.  You decide which journals you submit your manuscripts to.  I think it is time for everyone to use whatever influence they have to push publishers in all academic fields to follow the lead of publishers in the bio-medical sciences.  It is in everyone's best interest (even the publishers) that there be ready access, through both user and programming interfaces, to all of the basic citation data in all academic fields.

Improved Note Taking

| 5 Comments
I would like to say a few words about our latest release, which includes a major update to Sente's note-taking features.

Since the release of version 5.0 in 2007, Sente has included the ability to attach multiple notes to each reference. (Earlier versions restricted you to a single note per reference.) Each note included a title, page range, quotation and comments.

For many users, this has been a valuable feature, and they mention it regularly in posts about Sente. But we have never been particularly fond of Sente's interface for creating and editing notes. It worked, but it was neither attractive nor terribly functional.

In Sente 5.7, we have completely rewritten the notes interface and we are much happier with the results.  In the new interface, it is easy to see more than one note at a time, and we make better use of limited screen space.  Also, it is now possible to copy one or more notes from within the Notes view, and then to paste them into most other applications.

We have also added a feature that makes it simple to create notes from selections in PDFs or web pages. You can select either text or any arbitrary region in either a PDF or a web page, and then, if you click on the Create Note button, Sente will automatically create a new note for the current reference using the selection as the quotation.  Sente will also fill in the page numbers and add a placeholder title for you.

This ability to create notes from PDFs is not intended as a replacement for tools that let you mark-up a PDF, like Preview and Skim.  If what you want to accomplish is to highlight portions of a PDF so that the next time you read the document you will see these annotations, then we would recommend that you use one of these other tools.  (Annotations created in Preview, unlike those created in Skim, will show up in the Links view within Sente, so you will not need to return to Preview to view them.)

These changes should make Sente's notes feature more useful to many of our users.  I would encourage you to watch the new video on our web site and then to download the latest version and give it a try.

Screencast from a Sente User

| No Comments
One of our users, Dr. Kerim Friedman, has put together a nice screencast demonstrating the use of Sente to acquire references from Google Scholar and related PDFs from AnthroSource.  The video is included in one of his posts at Savage Minds, a site dedicated to topics in anthropology.

You can view the screencast here:


The basic flow works in many other fields, as well, so even if you are not an anthropologist, you might find the information valuable.