• How to run the script
    The is hosted in our GitHub repository of the Digital Narratives of Covid-19 project. In the main page of the repository,, there are two different buttons that allow to run a Binder environment: one of them launches the Frequency analysis Jupyter notebook, and the other one the Sentiment Analysis script. Let’s first launch … Read more
  • Reflections on quantified data: #ScholarStrike in the context of COVID-19
    Although the COVID-19 pandemic created a truly shared global context for the first time in years, it soon began to coexist with the local reality of each country. Twitter, as expected, was no stranger to this, and certain hashtags soon began to appear that account for this “localization” process of the pandemic (for example, in … Read more
  • Access our Twitter Collection
    We are happy to finally launch the interface to download a collection of tweets related to the Covid-19 pandemic. You can choose a range date, an area (Mexico, Argentina, Colombia, Perú, Ecuador, Spain, Miami area), and language (only for the Miami area, in English and Spanish). The texts are processed by removing accents, punctuations, … Read more
  • Frequency Analysis for South Florida (April – June)
    This post compares the top 30 most frequent words and the top 20 hashtags in our Twitter English and Spanish corpora of South Florida from April 25th to June 25th, 2020. We divided it into 2 four-week periods to analyze broad trends and themes in the discourse. For our corpus criteria as well as for … Read more
  • Outbreak Topics: Topic modeling of COVID-19
    In this post, we will present another way to explore our dataset of tweets on Covid-19. We intend to detect emerging topics of interest for our study of the social narratives about the pandemic. For this, we will perform unsupervised machine learning using different Python libraries. In this case, we work with data in Spanish, … Read more
  • What can academic journals tell us about COVID-19 and Education?
    The Covid situation has put new terms into our everyday vocabulary, terms such as pandemic or infodemic. This last one, according to Wiktionary can be defined as: Blend of information +‎ epidemic Noun infodemic (plural infodemics) (informal) An excessive amount of information concerning a problem such that the solution is made more difficult. (informal) A … Read more
  • Analyzing a Twitter Corpus with Voyant (I)
    The first step of working with data is to get to know your corpus. Our project, for instance, is most concerned with the linguistic and humanistic contexts in the Twitter discourses generated by the Covid-19 pandemic. Some starting “get-to-know-you” questions we are interested in about our corpus include the trend of daily corpus length, most … Read more
  • How to “hydrate” a TweetSet?
    Twitter public discourse is one of our project’s primary research concerns. Twitter’s rich data has also drawn more and more researchers from various disciplines and fields to explore different aspects of society. This blog post serves as a tutorial of using DocNow Hydrator to “hydrate” tweets. Our project, as we explained, is offering a series … Read more
  • A Twitter Dataset for Digital Narratives
    At the end of April we started to get familiarized with the Twitter API and asking how to capture the public conversations that are happening in this social media network. We quickly understood we needed to focus on a plan and method for organizing our corpus, accomplishing our objectives, and dividing the different tasks among … Read more
  • COVID-19 and Higher Ed. A Look From the Digital Humanities
    The 2020 opened with the news of a new disease. In a couple of weeks it became a global pandemic and we have all been concerned with this topic since then. Higher education is not exempt of it and in the last few months, we have seen how discussions on the pandemic have reached the … Read more
  • Mining Twitter and Covid-19 datasets
    The only topic these days: the coronavirus, Covid-19, the pandemic, SARS, the crisis, disease, the enemy, the survival… We all are under the same global situation and we all are concerned by the many impacts and consequences that this event is having and will have in our lives. This pandemic can be approached from infinite … Read more
  • Project resources
    Interested in knowing more or collaborating with our project? Our main platform is this WordPress site hosted at the University of Miami where we will be posting resources on Covid-19 data, from a humanistic and linguistic perspective, and documenting our work. All our data are stored in our repository in Github, providing in the near … Read more
  • Hello, world
    Digital Humanities can help us understand data from a humanistic perspective, and this seems particularly true in this time, in which data about Covid-19 seems to be everywhere and overwhelmingly generated in large volumes. Data in a social, humanistic and human context need to be critically analyzed. Digital Narratives of Covid-19 (DHCOVID) will explore during … Read more