Overview
What is TextData?
This page is under construction! Please check back later.
Background and Motivation
The TextData project (formally SeekNet, The Community Digital Library) began in the fall of 2022, and its original purpose was to support an information retrieval and ranking course assignment at the University of Illinois Urbana-Champaign. The original development was done by Kevin Ros, the TA at the time, who created a basic version of TextData to be used in the course. As the semester progressed, Kevin and his PhD advisor/course instructor, Dr. ChengXiang Zhai, realized that TextData could be generalized to a much more powerful online platform that could support numerous features, act as a sandbox for researching retrieval and recommendation, and allow the collection of various data sets to study user interactions. Since the fall of 2022, Kevin had led a team of developers, from the University of Illinois, under the supervision of Dr. Zhai, to build TextData into what you see here today. And TextData has become a central component in Kevin's PhD thesis.
Purpose
The purpose of TextData depends on you and your goals:
- As a user, ...
- As a developer, ...
- As a researcher, ...
Research
TextData has been built on and has led to numerous publications and posters:
- Ros, Kevin, Maxwell Jong, Chak Ho Chan, and ChengXiang Zhai. "Generation of Student Questions for Inquiry-based Learning." In Proceedings of the 15th International Conference on Natural Language Generation, pp. 186-195. 2022.
- Ros, Kevin, Matthew Jin, Jacob Levine, and ChengXiang Zhai. "Retrieving Webpages Using Online Discussions." In Proceedings of the 2023 ACM SIGIR International Conference on Theory of Information Retrieval, pp. 159-168. 2023.
- Ros, Kevin, and ChengXiang Zhai. "The CDL: An Online Platform for Creating Community-based Digital Libraries." In Computer Supported Cooperative Work and Social Computing, pp. 372-375. 2023.
- Ros, Kevin, and ChengXiang Zhai. "A Task-Focused View of the Community Digital Library." Presented at Task Focused IR in the Era of Generative AI, Microsoft Research, Redmond, Washington, September 28-29, 2023.
Our current and future research projects include many exciting directions, including contextual search, automatic content organization, content visualization, chatbot integration, and user studies. We are always looking for collaborators; if you would like to get involved, then please reach out to Kevin Ros at [email protected].
Setup
Creating an Account
You must make an account before you can begin using TextData. An account can be created here. Note that accounts made on the website will not work when running the the service locally through localhost, so you will need to create separate accounts.
Installing the Extension
The Chrome extension is available in the Chrome web store and can be installed from here. After installing, you will be able to log into your account that you created using the TextData website. The extension defaults to logging in users via the hosted version of TextData, so if you wish to use the extension with your local instance, then you will need to change the extension "Backend Source" setting.
Collaborations
Development
The source code for TextData is available on GitHub. If you notice a bug or would like to add a feature, then we welcome pull requests. If you have an idea for a larger feature or structural change, then please reach out to Kevin Ros at [email protected].
Research
We are always looking for collaborators across research disciplines. Whether it be information retrieval, recommendation, user studies, HCI, or any other area, please reach out to Kevin Ros at [email protected].