Finding Unique Words in a Document

Table of Contents

Background

“At the Mountains of Madness” is a fantasy horror novella written by the American author H.P. Lovecraft in 1931. The copyright for the novella has expired and is now in the public domain. We will use this text as input to process and will print a unique list of all the words in the document sorted in alphabetical order.

Assignment Download the Lab12.zip starter file. Use 7zip to unzip the file using ‘Extract Here’. Open the project folder in IntelliJ.

Don't use plagiarized sources. Get Your Custom Essay on
Finding Unique Words in a Document
Just from $13/Page
Order Essay

In the UniqueWords class, you will implement the processDocument() method that will read in the document one line at a time. For each String that is input, the code will break the String (if the String is not null or empty) into individual words (using the supplied tokenize() method). The code will then store the word into an appropriate Collection object (a sub-class of Set) that will store unique words in sorted order. The processDocument() method will return the collection.

Notes:

  • The returned list of words must be unique – no word may appear more than once.
  • The list of words must be sorted in alphabetic order.
  • The selection of the right Collection type will automatically guarantee that the list is both sorted and unique. You will not add any special code beyond choosing the correct Collection type to handle sorting and uniqueness.

Example Output

a

abandoned

abandonment

abdul

abhor

abhorred

abhorrent

. . .many lines of output omitted. . .

you

young

younger

zeal

zenith

zero

ziggurats

Got 6099 words

Homework Writing Bay
Calculator

Calculate the price of your paper

Total price:$26
Our features

We've got everything to become your favourite writing service

Need a better grade?
We've got you covered.

Order your paper