Project 4: LLM Augmented Optional Tasks

If you’re reading this, we’re assuming you’ve already implemented the Hypohist button. For this final part of the project, we encourage you to use LLMs to try to build something even bigger.

Important: Do not use LLM generated code for any portion of 4c and 4d. LLM generated code is only allowed for your own chosen “above and beyond” features!

Some possibilities:

  • Adding additional buttons that use one or both datasets in some creative way. For example, you might plot the average length of all words in a given year. Or you might create a visualization of all of a words’ hyponyms. Or you might have a feature that prints the shortest path between two words.
  • The hyponyms search finds all hyponyms, no matter how distant from the source. For example, there are a huge number of hyponyms of “dog”. Add a new field d to the front end, which finds only words that are at a distance of d or less from the given words.
  • Add a ! operator, e.g. if someone enters “!person, leader”, your code will find all leaders which are not a person.
  • Explore statistical properties of the dataset, for example by plotting counts (on the y-axis) vs. ranks (on the x-axis), you should be able to see Zipf’s law in action.

If you discover or build something cool, email me (hug@cs.berkeley.edu) or come by office hours and/or demo it.