Search For:

Displaying 1-50 out of 59 total
Why web sites are lost (and how they're sometimes found)
Found in: Communications of the ACM
By Catherine C. Marshall, Frank McCown, Frank McCown, Frank McCown, Michael L. Nelson, Michael L. Nelson, Michael L. Nelson
Issue Date:November 2009
pp. 141-145
Introduction The web is in constant flux---new pages and Web sites appear daily, and old pages and sites disappear almost as quickly. One study estimates that about two percent of the Web disappears from its current location every week.2 Although Web users...
     
NCSTRL+: Adding Multi-Discipline and Multi-Genre Support to the Dienst Protocol Using Clusters and Buckets
Found in: Advances in Digital Libraries Conference, IEEE
By Michael L. Nelson, Kurt Maly, Stewart N.T. Shen, Mohammad Zubair
Issue Date:April 1998
pp. 128
We describe NCSTRL+, a unified, canonical digital library for scientific and technical information (STI). NCSTRL+ is based on the Networked Computer Science Technical Report Library (NCSTRL), a World Wide Web (WWW) accessible digital library (DL) that prov...
 
Adaptive Networks of Smart Objects
Found in: Parallel Processing Workshops, International Conference on
By Johan Bollen, Michael L. Nelson
Issue Date:August 2002
pp. 487
We propose the implementation of a distributed system for DL storage and retrieval which relies on two earlier developed technologies: buckets and document linking based on user retrieval patterns. Buckets are expanded with a mechanism to keep track of wei...
 
The Dienst-OAI Gateway
Found in: Digital Libraries, Joint Conference on
By Terry L. Harrison, Michael L. Nelson, Mohammad Zubair
Issue Date:May 2003
pp. 309
Though the Open Archive Initiative Protocol for Metadata Harvesting (OAI-PMH) is becoming the de facto standard for digital libraries, some of its predecessors are still in use. Although a limited number of Dienst repositories continue to be populated, oth...
 
Distributed Computing Architectures for Digital Libraries Workshop
Found in: Parallel Processing Workshops, International Conference on
By Johan Bollen, Michael L. Nelson
Issue Date:August 2002
pp. 429
No summary available.
   
Video recommendations for the open video project
Found in: Digital Libraries, Joint Conference on
By Raquel Araujo, Gary Geisler, Michael L. Nelson, Johan Bollen
Issue Date:June 2005
pp. 369-369
We describe a DL multimedia recommender system implemented for the Open Video project. Recommendations are generated by a spreading activation algorithm operating on a video network created from log download sequences. We compared the system's recommendati...
 
Toolkits for Visualizing Co-Authorship Graph
Found in: Digital Libraries, Joint Conference on
By Xiaoming Liu, Johan Bollen, Michael L. Nelson, Herbert Van de Sompel, Jeremy Hussell, Rick Luce, Linn Marks
Issue Date:June 2004
pp. 404-404
Visualization eases insight into complex systems such as co-authorship networks. We present an initial deployment of an author navigator application for convenient visual examination of JCDL and LANL co-authorship networks.
   
Failure Planning in STESCA
Found in: Computer Software and Applications Conference, Annual International
By Michael L. Nelson, Guillermo Flores Jr
Issue Date:October 1999
pp. 334
Mission specification for autonomous or robotic vehicles can be fairly straightforward if everything goes well. However, if part of the mission fails for any reason, the vehicle must have some type of backup plan. Unfortunately, this failure planning adds ...
 
A Design Pattern for Autonomous Vehicle Software Control Architectures
Found in: Computer Software and Applications Conference, Annual International
By Michael L. Nelson
Issue Date:October 1999
pp. 172
Design patterns represent a generalized approach to solving a related set of problems. Typically, a design pattern does not provide reusable code. Instead, it provides a common vocabulary and a generalized approach in an application-independent manner. We ...
 
If you harvest arXiv.org, will they come?
Found in: Digital Libraries, Joint Conference on
By Johan Bollen, Michael L. Nelson
Issue Date:June 2005
pp. 393-393
We examine which NASA Technical Report Server (NTRS) repositories have received the most downloads during 15 months of operation. In particular, we explore the collection development policy of including non-NASA scientific, technology and medicine (STM) re...
 
The OAI-PMH NASA Technical Report Server
Found in: Digital Libraries, Joint Conference on
By Michael L. Nelson, JoAnne R. Calhoun, Calvin E. Mackey
Issue Date:June 2004
pp. 400-400
The NASA Technical Report Server (NTRS) is now based on the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH). This metadata harvesting version of NTRS represents a significant improvement over the previous distributed searching implement...
   
Repository Synchronization in the OAI Framework
Found in: Digital Libraries, Joint Conference on
By Xiaoming Liu, Kurt Maly, Mohammad Zubair, Michael L. Nelson
Issue Date:May 2003
pp. 191
The Open Archives Initiative Protocol forMetadata Harvesting (OAI-PMH) began as an alternative to distributed searching of scholarly eprint repositories. The model embraced by the OAI-PMH is that of metadata harvesting, where value-added services (by a
 
Arc: An OAI Service Provider for Cross-Archive Searching
Found in: Digital Libraries, Joint Conference on
By Xiaoming Liu, Kurt Maly, Mohammad Zubair, Michael L. Nelson
Issue Date:June 2001
pp. 65-66
The usefulness of the many on-line journals and scientific digital libraries that exist today is limited by the lack of a service that can federate them through a unified interface. The Open Archive Initiative (OAI) is one major effort to address technical...
   
A Bucket Architecture for the Open Video Project
Found in: Digital Libraries, Joint Conference on
By Michael L. Nelson, Gary Marchionini, Gary Geisler, Meng Yang
Issue Date:June 2001
pp. 310-311
The Open Video project is a collection of public domain digital video available for research and other purposes. The Open Video collection currently consists of approximately 350 video segments, ranging in duration from 10 seconds to 1 hour. Rapid growth f...
   
Object-Oriented Robotics
Found in: Object-Oriented Real-Time Dependable Systems, IEEE International Workshop on
By Michael L. Nelson
Issue Date:January 1999
pp. 234
The real-time control of a robotic vehicle is a highly complex task. The vehicle consists of several components, all of which must be controlled both individually and collectively for the vehicle to operate properly. The control of individual components ge...
 
Support issues in distance learning: supporting teachers and learners
Found in: Proceedings of the 43rd annual southeast regional conference - Volume 1 (ACM-SE 43)
By Don Goff, Ken Hoganson, Michael L. Nelson, Nelson C. Baker, Solomon Negash
Issue Date:March 2005
pp. 27-28
Distance Learning physically separates the learner from the teacher often forfeiting the traditional support available in a face-to-face model. What are the issues and lessons learned in supporting teachers and learners in Distance Learning? The panelists ...
     
A Software Control Architecture for Autonomous Vehicles
Found in: Hawaii International Conference on System Sciences
By Michael L. Nelson
Issue Date:January 1998
pp. 226
The Strategic-Tactical-Execution Software Control Architecture (STESCA) is used to control Autonomous Vehicles. STESCA, a tri-level approach, uses the principle of abstraction to simplify the problems of mission and vehicle control. The three levels of STE...
   
Unsupervised creation of small world networks for the preservation of digital objects
Found in: Proceedings of the 2009 joint international conference on Digital libraries (JCDL '09)
By Charles L. Cartledge, Michael L. Nelson
Issue Date:June 2009
pp. 1-1
The prevailing model for digital preservation is that archives should be similar to a "fortress": a large, protective infrastructure built to defend a relatively small collection of data from attack by external forces. Such projects are a luxury, suitable ...
     
Self-arranging preservation networks
Found in: Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries (JCDL '08)
By Charles L. Cartledge, Michael L. Nelson
Issue Date:June 2008
pp. 597-617
We pose the question: what if digital library objects could self-arrange without intervention from repositories and minimal intervention from administrators? We present background information about networks, techniques on how networks can be created based ...
     
Just-in-time recovery of missing web pages
Found in: Proceedings of the seventeenth conference on Hypertext and hypermedia (HYPERTEXT '06)
By Michael L. Nelson, Terry L. Harrison
Issue Date:August 2006
pp. 145-156
We present Opal, a light-weight framework for interactively locating missing web pages (http status code 404). Opal is an example of "in vivo" preservation: harnessing the collective behavior of web archives, commercial search engines, and research project...
     
Resources for teaching web science to computer science undergraduates (abstract only)
Found in: Proceedings of the 45th ACM technical symposium on Computer science education (SIGCSE '14)
By Frank McCown, Michael L. Nelson
Issue Date:March 2014
pp. 725-725
This poster reports on course materials that have been developed for teaching an Introduction to Web Science course to computing majors at the undergraduate level. This is the first comprehensive collection of resources that we are aware of that targets un...
     
Semi-automated rediscovery of lost YouTube music videos
Found in: Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries (JCDL '13)
By Daniel Sebastian, Frank McCown, Michael L. Nelson
Issue Date:July 2013
pp. 421-422
Users frequently post popular material to YouTube, and in response, others link to these videos from social media, blogs, forums, and email. However, this content may be removed for numerous reasons, only to resurface again at another URL. This continuous ...
     
A memento web browser for iOS
Found in: Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries (JCDL '13)
By Frank McCown, Heather Tweedy, Michael L. Nelson
Issue Date:July 2013
pp. 371-372
The Memento framework allows web browsers to request and view archived web pages in a transparent fashion. However, Memento is still in the early stages of adoption, and browser-plugins are often required to enable Memento support. We report on a new iOS a...
     
ArcLink: optimization techniques to build and retrieve the temporal web graph
Found in: Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries (JCDL '13)
By Ahmed AlSum, Michael L. Nelson
Issue Date:July 2013
pp. 377-378
We present ArcLink, a proof-of-concept system that complements open source Wayback Machine installations by optimizing the construction, storage, and access to the temporal web graph. We divide the web graph construction into four stages (filtering, extrac...
     
Access patterns for robots and humans in web archives
Found in: Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries (JCDL '13)
By Michael L. Nelson, Michele C. Weigle, Yasmin A. AlNoamany
Issue Date:July 2013
pp. 339-348
Although user access patterns on the live web are well-understood, there has been no corresponding study of how users, both humans and robots, access web archives. Based on samples from the Internet Archive's public Wayback Machine, we propose a set of bas...
     
Reading the correct history?: modeling temporal intention in resource sharing
Found in: Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries (JCDL '13)
By Hany M. SalahEldeen, Michael L. Nelson
Issue Date:July 2013
pp. 257-266
The web is trapped in the "perpetual now", and when users traverse from page to page, they are seeing the state of the web resource (i.e., the page) as it exists at the time of the click and not necessarily at the time when the link was made. Thus, a tempo...
     
Evaluating sliding and sticky target policies by measuring temporal drift in acyclic walks through a web archive
Found in: Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries (JCDL '13)
By Michael L. Nelson, Scott G. Ainsworth
Issue Date:July 2013
pp. 39-48
When a user views an archived page using the archive's user interface (UI), the user selects a datetime to view from a list. The archived web page, if available, is then displayed. From this display, the web archive UI attempts to simulate the web browsing...
     
An evaluation of caching policies for memento timemaps
Found in: Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries (JCDL '13)
By Justin F. Brunelle, Michael L. Nelson
Issue Date:July 2013
pp. 267-276
As defined by the Memento Framework, TimeMaps are machine-readable lists of time-specific copies -- called "mementos" -- of an archived original resource. In theory, as an archive acquires additional mementos over time, a TimeMap should be monotonically in...
     
Synchronicity: automatically rediscover missing web pages in real time
Found in: Proceeding of the 11th annual international ACM/IEEE joint conference on Digital libraries (JCDL '11)
By Martin Klein, Michael L. Nelson, Moustafa Aly
Issue Date:June 2011
pp. 475-476
Missing web pages (pages that return the 404 "Page Not Found" error) are part of the browsing experience. The manual use of search engines to rediscover such pages can be frustrating and unsuccessful. We introduce Synchronicity, a Mozilla Firefox add-on th...
     
Persistent annotations deserve new URIs
Found in: Proceeding of the 11th annual international ACM/IEEE joint conference on Digital libraries (JCDL '11)
By Abdulla Alasaadi, Michael L. Nelson
Issue Date:June 2011
pp. 195-198
Some digital libraries support annotations, but sharing these annotations with other systems or across the web is difficult because of the need of special applications to read and decode these annotations. Due to the frequent change of web resources, the a...
     
Rediscovering missing web pages using link neighborhood lexical signatures
Found in: Proceeding of the 11th annual international ACM/IEEE joint conference on Digital libraries (JCDL '11)
By Jeb Ware, Martin Klein, Michael L. Nelson
Issue Date:June 2011
pp. 137-140
For discovering the new URI of a missing web page, lexical signatures, which consist of a small number of words chosen to represent the "aboutness" of a page, have been previously proposed. However, prior methods relied on computing the lexical signature b...
     
How much of the web is archived?
Found in: Proceeding of the 11th annual international ACM/IEEE joint conference on Digital libraries (JCDL '11)
By Ahmed Alsum, Hany SalahEldeen, Michael L. Nelson, Michele C. Weigle, Scott G. Ainsworth
Issue Date:June 2011
pp. 133-136
The Memento Project's archive access additions to HTTP have enabled development of new web archive access user interfaces. After experiencing this web time travel, the in- evitable question that comes to mind is "How much of the Web is archived?" This ques...
     
Evaluating methods to rediscover missing web pages from the web infrastructure
Found in: Proceedings of the 10th annual joint conference on Digital libraries (JCDL '10)
By Martin Klein, Michael L. Nelson
Issue Date:June 2010
pp. 59-68
Missing web pages (pages that return the 404 "Page Not Found error) are part of the browsing experience. The manual use of search engines to rediscover missing pages can be frustrating and unsuccessful. We compare four automated methods for rediscovering w...
     
Comparing the performance of us college football teams in the web and on the field
Found in: Proceedings of the 20th ACM conference on Hypertext and hypermedia (HT '09)
By Martin Klein, Michael L. Nelson, Olena Hunsicker
Issue Date:June 2009
pp. 70-73
In previous research it has been shown that link-based web page metrics can be used to predict experts' assessment of quality. We are interested in a related question: do expert rankings of real-world entities correlate with search engine (SE) rankings of ...
     
Correlation of music charts and search engine rankings
Found in: Proceedings of the 2009 joint international conference on Digital libraries (JCDL '09)
By Martin Klein, Michael L. Nelson, Olena Hunsicker
Issue Date:June 2009
pp. 1-1
We investigate the question whether expert rankings of real-world entities correlate with search engine (SE) rankings of corresponding web resources. We compare Billboards "Hot 100 Airplay" music charts with SE rankings of associated web resources. Out of ...
     
Inter-search engine lexical signature performance
Found in: Proceedings of the 2009 joint international conference on Digital libraries (JCDL '09)
By Martin Klein, Michael L. Nelson
Issue Date:June 2009
pp. 1-1
We generate lexical signatures (LSs) from web pages and acquire the mandatory document frequency values from three dierent search engine (SE) indexes. We cross-query the LSs against the two SEs they were not generated from and compare the retrieval perform...
     
A framework for describing web repositories
Found in: Proceedings of the 2009 joint international conference on Digital libraries (JCDL '09)
By Frank McCown, Michael L. Nelson
Issue Date:June 2009
pp. 1-1
In prior work we have demonstrated that search engine caches and archiving projects like the Internet Archive's Wayback Machine can be used to "lazily preserve" website and reconstruct them when they are lost. We use the term "web repositories" for collect...
     
What happens when facebook is gone?
Found in: Proceedings of the 2009 joint international conference on Digital libraries (JCDL '09)
By Frank McCown, Michael L. Nelson
Issue Date:June 2009
pp. 1-1
Web users are spending more of their time and creative energies within online social networking systems. While many of these networks allow users to export their personal data or expose themselves to third-party web archiving, some do not. Facebook, one of...
     
Using timed-release cryptography to mitigate the preservation risk of embargo periods
Found in: Proceedings of the 2009 joint international conference on Digital libraries (JCDL '09)
By Michael L. Nelson, Rabia Haq
Issue Date:June 2009
pp. 1-1
Due to temporary access restrictions, embargoed data cannot be refreshed to unlimited parties during the embargo time interval. A solution to mitigate the risk of data loss has been developed that uses a data dissemination framework, the Timed-Locked Embar...
     
A comparison of techniques for estimating IDF values to generate lexical signatures for the web
Found in: Proceeding of the 10th ACM workshop on Web information and data management (WIDM '08)
By Martin Klein, Michael L. Nelson
Issue Date:October 2008
pp. 1-2
For bounded datasets such as the TREC Web Track the computation of term frequency (TF) and inverse document frequency (IDF) is not difficult. However, since IDF cannot be directly calculated for the entire web, it must be estimated. We see a need to estima...
     
Usage analysis of a public website reconstruction tool
Found in: Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries (JCDL '08)
By Frank McCown, Michael L. Nelson
Issue Date:June 2008
pp. 597-617
The Web is increasingly the medium by which information is published today, but due to its ephemeral nature, web pages and sometimes entire websites are often "lost" due to server crashes, viruses, hackers, run-ins with the law, bankruptcy and loss of inte...
     
Recovering a website's server components from the web infrastructure
Found in: Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries (JCDL '08)
By Frank McCown, Michael L. Nelson
Issue Date:June 2008
pp. 597-617
Our previous research has shown that the collective behavior of search engine caches (e.g., Google, Yahoo, Live Search) and web archives (e.g., Internet Archive) results in the uncoordinated but large-scale refreshing and migrating of web resources. Intera...
     
Augmenting OAI-PMH repository holdings using search engine APIs
Found in: Proceedings of the 2007 conference on Digital libraries (JCDL '07)
By Juliet Z. Pao, Martin Klein, Michael L. Nelson
Issue Date:June 2007
pp. 486-486
In this poster, we give the preliminary results of our project to acquire Atmospheric Science Data Center (ASDC) project-related web resources, not with focused crawling, but by using the search engine (SE) APIs directly. We aggregate the results and creat...
     
Agreeing to disagree: search engines and their public interfaces
Found in: Proceedings of the 2007 conference on Digital libraries (JCDL '07)
By Frank McCown, Michael L. Nelson
Issue Date:June 2007
pp. 309-318
Google, Yahoo and MSN all provide both web user interfaces (WUIs) and application programming interfaces (APIs) to their collections. Whether building collections of resources or studying the search engines themselves, the search engines request that resea...
     
Generating best-effort preservation metadata for web resources at time of dissemination
Found in: Proceedings of the 2007 conference on Digital libraries (JCDL '07)
By Joan A. Smith, Michael L. Nelson
Issue Date:June 2007
pp. 51-52
HTTP and MIME, while sufficient for contemporary webpage access, do not provide enough forensic information to enable the long-term preservation of the resources they describe and transport. But what if the originating web server automatically provided pre...
     
Factors affecting website reconstruction from the web infrastructure
Found in: Proceedings of the 2007 conference on Digital libraries (JCDL '07)
By Frank McCown, Michael L. Nelson, Norou Diawara
Issue Date:June 2007
pp. 39-48
When a website is suddenly lost without a backup, it maybe reconstituted by probing web archives and search engine caches for missing content. In this paper we describe an experiment where we crawled and reconstructed 300 randomly selected websites on a we...
     
Search engines and their public interfaces: which apis are the most synchronized?
Found in: Proceedings of the 16th international conference on World Wide Web (WWW '07)
By Frank McCown, Michael L. Nelson
Issue Date:May 2007
pp. 1197-1198
Researchers of commercial search engines often collect datausing the application programming interface (API) or by"scraping" results from the web user interface (WUI), butanecdotal evidence suggests the interfaces produce differentresults. We provide the f...
     
Evaluation of crawling policies for a web-repository crawler
Found in: Proceedings of the seventeenth conference on Hypertext and hypermedia (HYPERTEXT '06)
By Frank McCown, Michael L. Nelson
Issue Date:August 2006
pp. 157-168
We have developed a web-repository crawler that is used for reconstructing websites when backups are unavailable. Our crawler retrieves web resources from the Internet Archive, Google, Yahoo and MSN. We examine the challenges of crawling web repositories, ...
     
Repository replication using SMTP and NNTP
Found in: Proceedings of the 2006 national conference on Digital government research (dg.o '06)
By Joan A. Smith, Martin Klein, Michael L. Nelson
Issue Date:May 2006
pp. 436-437
We describe our progress on NSF ISS 0455997, "Shared Infrastructure Preservation Models". The focus of our efforts is to evaluate different preservation models based on Internet infrastructure that sites already have. Specifically, we investigate replicati...
     
Distributed, real-time computation of community preferences
Found in: Proceedings of the sixteenth ACM conference on Hypertext and hypermedia (HYPERTEXT '05)
By Johan Bollen, Michael L. Nelson, Thomas Lutkenhouse
Issue Date:September 2005
pp. 88-97
We describe the integration of smart digital objects with Hebbian learning to create a distributed, real-time, scalable approach to adapting to a community's preferences. We designed an experiment using popular music as the subject matter. Each digital obj...
     
 1  2 Next >>