Text Visualizer

Dave Boyhan - April 15, 2011 / March 22, 2011

Learning Bit-by-Bit - Final - Proposal
Dynamic Web Design - Midterm

Text visualization is a process whereby text, whether an article, a book, a poem, a press release or the Constitution, is passed through an analytic algorithm that identifies parts of speech, stop words, most common phrases, unusual phrases, phonetic structure, etc. and then renders that data into a visual display. Examples are under links. The rendering techniques can vary significantly, from a simple "most common word" cloud, to images created by changing the path of a line as the part of the speech in the sentence changes. Similarly, abstract geometric shapes can be created by using the metrics derived from the document. Some examples follow.

Metrics that can be drawn from LingPipe and NLTK include: tokens, unique phrases, parts of speech, text length, etc.

Character Flower
Document Arc
Sentence_Crawl
Sentence Drawings
Literary Organism
This site allows users to:

  1. Upload text samples to a database be visualized using a variety of algorithms
  2. Review the text samples that have been uploaded
  3. Render the text into a variety of different visualizations and save those rendering to a database
  4. Review renderings that others have prepared


site layout

The following is the MySQL table format for storing the texts to upload. It stores names and paths.
Text Visualizer Text Table Format


The following is the MySQL table format for storing the visualizations that have been rendered. It stores names and paths. Text Visualizer Images Table Format