2018-08-30: Excited to Join WS-DL group in ODU!
I am an outlier compared with most computer scientists because I spent 10 years on a field called "Astronomy and Astrophysics". Very few computer scientists followed the same path as me to transfer...
View Article2018-09-02: Sampath Jayarathna (Assistant Professor, Computer Science)
I am really excited to be part of the Old Dominion University and the WS-DL group. I joined the faculty at Old Dominion University in 2018. Before that, I was a tenure-track assistant professor for two...
View Article2018-09-03: Let's compare memento damage measures!
It is always nice getting a Google Scholar alert that one of my papers has been cited. In this case, I learned that the paper "Reproducible Web Corpora: Interactive Archiving with Automatic Quality...
View Article2018-09-03: Trip Report for useR! 2018 Conference
This year I was really lucky to get my abstract and poster accepted for useR! 2018 conference. The UseR! conference is an annual worldwide conference for international R users and developer community....
View Article2018-10-10: Americans More Open Than Asians to Sharing Personal Information...
Mat Kelly reviews "A Personal Privacy Preserving Framework..." by Song et al. at SIGIR 2018....
View Article2018-10-11: iPRES 2018 Trip Report
September 24th marked the beginning of iPRES 2018 located in Boston, MA, for which both Shawn Jones and I traveled from New Mexico to present our accepted papers: Measuring News Similarity Across Ten...
View ArticleSome tricks to parse XML files
Recently I was parsing the ACM DL metadata in XML files. I thought parsing XML is a very straightforward job provided that Python has been there for a long time with sophisticated packages such as...
View Article2018-11-08: Decentralized Web Summit: Shaping the Next Web
In my wallet I have a few ₹500 Indian currency notes that say, "I PROMISE TO PAY THE BEARER THE SUM OF FIVE HUNDRED RUPEES" followed by the signature of the Governor of the Reserve Bank of India....
View Article2018-11-09: Grok Pattern
Grok is a way to match a text line against a regular expression, map specific parts of the line into dedicated fields, and perform actions based on this mapping. Grok patterns are (usually long)...
View Article2018-11-10: Scientific news and reports should cite original papers
I highly encourage all scientific news or reports cite corresponding articles. ScienceAlert usually does a good job on this. This piece of scientific news from ScienceAlert discovers two Rogue planets....
View Article2018-10-11: More than 7000 retracted abstracts from IEEE. Can we find them...
One publisher, more than 7000 retractionsScience magazine:More than 7000 abstracts are quietly retracted from the IEEE database. Most of these abstracts are from IEEE conferences that took place...
View Article2018-11-12: Google Scholar May Need To Look Into Its Citation Rate
Google Scholar has long been regarded as a digital library containing the most complete collection of scholarly papers and patterns. For a digital library, completeness is very important because...
View Article2018-11-15: LANL Internship Report
Los Alamos National LaboratoryOn May 27 I landed in sunny Sante Fe, New Mexico to start my 6 month internship at Los Alamos National Laboratory (LANL) for the Digital Library Research and Prototyping...
View Article2018-11-30: The Illusion of Multitasking Boosts Performance
Today, I read the article on https://www.psychologicalscience.org/news/releases/the-illusion-of-multitasking-boosts-performance.htmlThe title is "The Illusion of Multitasking Boosts Performance". At...
View Article2018-11-30: Archives Unleashed: Vancouver Datathon Trip Report
The Archives Unleashed Datathon #Vancouver was a two day event from November 1 to November 2, 2018 hosted by the Archives Unleashed team in collaboration with Simon Fraser University Library and Key,...
View Article2018-12-03: Acidic Regression of WebSatchel
Mat Kelly reviews WebSatchel, a browser based personal preservation tool....
View Article2018-12-03: Using Wikipedia to build a corpus, classify text, and more
Wikipedia is an online encyclopedia, available in 301 different languages, and constantly updated by volunteers. Wikipedia is not only an encyclopedia, but it also has been used as an ontology to build...
View Article2018-12-14: CNI Fall 2018 Trip Report
Mat Kelly reports on his recent trip to Washington, DC for the CNI Fall 2018 meeting...
View Article2018-12-14: New Insight to Big Data: Trip to IEEE Big Data 2018
The IEEE Big Data 2018 was held in the Westin Seattle Hotel between December 10 and December 13, 2018. There are more than 1100 people registered. The accepting rates vary between 13% to 24%, with an...
View Article2018-12-17: CoQA Challenge: Machine Reading Competition Recent Result
CoQA is a dataset containing more than 127,000 questions with answers collected from more than 8000 conversations. Each conversation is about a passage in the form of questions and answers. One example...
View Article