2020-08-25: Seven WS-DL classes offered for Fall 2020
https://xkcd.com/2347/A record seven courses from the Web Science and Digital Libraries (WS-DL) Group will be offered in Fall 2020. Because of Covid-19, most of these classes will either be completely...
View Article2020-08-27: Summer Internship Report — Los Alamos National Laboratory
Considering the epidemic of COVID-19, when everything was uncertain, in early May 2020, I was accepted to Applied Machine Learning (AML) Summer Research Fellowship Program at Los Alamos National...
View Article2020-08-27: A 25 Year Retrospective on D-Lib Magazine
Authors’ note: This document is also available as https://arxiv.org/abs/2008.11680. In this HTML version, the footnotes were converted to either hyperlinks or endnotes. A 25 Year Retrospective on D-Lib...
View Article2020-08-30: Google Translate + Stanford NERC produce comparable results to...
Arabic Named Entity Recognition and ClassificationNamed Entity Recognition and Classification (NERC) is very important for many text processing tasks. However, Arabic NERC research is not popular...
View Article2020-09-01: DNC vs RNC pulses - Quantifying news attention for the DNC & RNC...
Figure 1 (click figure to enlarge): Illustration of the level of attention given to the Democratic National Convention (DNC) story by news organization measured with StoryGraph's longitudinal data. The...
View Article2020-09-04 Student ThinSat Research Summer Camp for Hampton Roads High School...
The Student ThinSat Research Summer (STRS) Camp was held virtually from August 3 – August 14, 2020. The event was sponsored by a Virginia Space Grant Consortium Innovate Program grant with faculty and...
View Article2020-09-09: Theory and Practice of Digital Libraries 2020 (TPDL 2020)...
The 2020 Theory and Practice of Digital Libraries (TPDL 2020) was planned to take place in Lyon, France, but was virtually hosted via Big Blue Button. It was a joint conference with ADBIS 2020 and EDA...
View Article2020-09-14: International Conference on Artificial Intelligence in Medicine...
The 2020 International Conference on Artificial Intelligence in Medicine (AIME 2020), hosted by the University of Minnesota, was held virtually 25-28 August 2020. The first two days were dedicated to...
View Article2020-09-17: IEEE International Conference on Information Reuse and...
blockquote { margin: auto; } The 21st International Conference on Information Reuse and Integration for Data Science (IRI 2020) was held virtually (due to the COVID-19 pandemic) instead of Las Vegas...
View Article2020-09-27: Xin Wei (Computer Science PhD student)
This is Xin Wei. I started working with Dr. Wu in the summer of 2020. I received bachelor's degree in Economics from Shanghai Jiao Tong University, China. I also got a Master's degree from Stony Brook...
View Article2020-09-29: James Ecker (Computer Science PhD Student)
Hello WSDL Blog readers! My name is James (Jim/Jimmy) Ecker and I joined the Web Science and Digital Libraries (WS-DL) research group at Old Dominion University as a Ph.D student in Fall 2019. I...
View Article2020-09-28: My report card to my mother
On July 28, 2020, I defended my PhD dissertation --- Bootstrapping Web Archive Collections From Micro-Collections in Social Media --- a culmination of an 12-year journey that began when I arrived the...
View Article2020-09-28: A PhD is a very long tunnel with a light at the end
My PhD defense committee: From the top left, Dr. M. Nelson (my co-advisor), Dr. M. Weigle (my advisor), Dr. M. Abdous, Dr. S. Jayarathna, Dr. J. Wu, and M. Aturban (myself). This year has been tragic...
View Article2020-08-05: Trip report to WOSP: the 8th Workshop on Mining Scientific...
The International Workshop on Mining Scientific Publications (WOSP) started in 2012. The main theme is to use Natural Language Processing (NLP) and text mining tools to aid knowledge creation and...
View Article2020-11-03: 19 Years of Wayback – Inspiring the collection and replay of the web
The Internet Archive’s Wayback Machine is almost 20 years old. As the Wayback Machine nears its second decade full of operation, I reflected on how my research has been inspired by the work that goes...
View Article2020-11-04: How well is Instagram archived?
Figure 1: Snapshots of Katy Perry’s account page on the three leading social media platforms: Instagram, Facebook, and Twitter.A little bit about InstagramIn 2020, social media is considered as one of...
View Article2020-11-04: New Twitter UI: Replaying Archived Twitter Pages That Never Existed
Figure 1: Multiple Temporal Violations in an archived page with the new Twitter interface. When you visit web archives to go back in time and look at a web page, you naturally expect it to display the...
View Article2020-11-15: Sapien Labs Virtual Symposium on Mental Health Trip Report
The Sapien Labs Virtual Symposium on The Future of Mental Health: Measurement, Treatment and Therapies was held virtually via Adobe Connect on 2-3 November 2020. The symposium consisted of 2 sessions...
View Article2020-11-18: Creating Collection Growth Curves With Archives Unleashed Toolkit...
Figure 1: Creating collection growth curves with a web page text derivativeRecently, I have been learning about Archives Unleashed Toolkit (AUT), Hypercane, and how these tools can be used together....
View Article2020-12-02: Comparing Four OCR Tools on US Patent Figure Label Recognition
The task is to extract labels from US patent figures. Patent figures are different from natural images. They are usually drawings of an object, or diagrams such as circuits. A figure file may contain...
View Article