Blog

  • Access our Twitter Collection
    We are happy to finally launch the interface to download a collection of tweets related to the Covid-19 pandemic. You can choose a range date, an area (Mexico, Argentina, Colombia, Perú, Ecuador, Spain, Miami area), and language (only for the Miami area, in English and Spanish). https://covid.dh.miami.edu/get/ The texts are processed by removing accents, punctuations, […]
  • Frequency Analysis for South Florida (April – June)
    This post compares the top 30 most frequent words and the top 20 hashtags in our Twitter English and Spanish corpora of South Florida from April 25th to June 25th, 2020. We divided it into 2 four-week periods to analyze broad trends and themes in the discourse. For our corpus criteria as well as for […]
  • Outbreak Topics: Topic modeling of COVID-19
    In this post, we will present another way to explore our dataset of tweets on Covid-19. We intend to detect emerging topics of interest for our study of the social narratives about the pandemic. For this, we will perform unsupervised machine learning using different Python libraries. In this case, we work with data in Spanish, […]
  • What can academic journals tell us about COVID-19 and Education?
    The Covid situation has put new terms into our everyday vocabulary, terms such as pandemic or infodemic. This last one, according to Wiktionary can be defined as: Blend of information +‎ epidemic Noun infodemic (plural infodemics) (informal) An excessive amount of information concerning a problem such that the solution is made more difficult. (informal) A […]
  • Analyzing a Twitter Corpus with Voyant (I)
    The first step of working with data is to get to know your corpus. Our project, for instance, is most concerned with the linguistic and humanistic contexts in the Twitter discourses generated by the Covid-19 pandemic. Some starting “get-to-know-you” questions we are interested in about our corpus include the trend of daily corpus length, most […]
  • How to “hydrate” a TweetSet?
    Twitter public discourse is one of our project’s primary research concerns. Twitter’s rich data has also drawn more and more researchers from various disciplines and fields to explore different aspects of society. This blog post serves as a tutorial of using DocNow Hydrator to “hydrate” tweets. Our project, as we explained, is offering a series […]
  • A Twitter Dataset for Digital Narratives
    At the end of April we started to get familiarized with the Twitter API and asking how to capture the public conversations that are happening in this social media network. We quickly understood we needed to focus on a plan and method for organizing our corpus, accomplishing our objectives, and dividing the different tasks among […]
  • COVID-19 and Higher Ed. A Look From the Digital Humanities
    The 2020 opened with the news of a new disease. In a couple of weeks it became a global pandemic and we have all been concerned with this topic since then. Higher education is not exempt of it and in the last few months, we have seen how discussions on the pandemic have reached the […]
  • Mining Twitter and Covid-19 datasets
    The only topic these days: the coronavirus, Covid-19, the pandemic, SARS, the crisis, disease, the enemy, the survival… We all are under the same global situation and we all are concerned by the many impacts and consequences that this event is having and will have in our lives. This pandemic can be approached from infinite […]
  • Project resources
    Interested in knowing more or collaborating with our project? Our main platform is this WordPress site hosted at the University of Miami where we will be posting resources on Covid-19 data, from a humanistic and linguistic perspective, and documenting our work. All our data are stored in our repository in Github, providing in the near […]
  • Hello, world
    Digital Humanities can help us understand data from a humanistic perspective, and this seems particularly true in this time, in which data about Covid-19 seems to be everywhere and overwhelmingly generated in large volumes. Data in a social, humanistic and human context need to be critically analyzed. Digital Narratives of Covid-19 (DHCOVID) will explore during […]