Archive for February, 2011

Ngram’s, Press Releases and the SEC

February 25, 2011

I tried a few different datasets to generate the text. The first was a set of press releases. The second was from filings made by companies with the Securities and Exchange Commission. press-releases text-release-generated 10q_mda generated

Posted in Learning_Bit_by_Bit No Comments »

Stop List – Stop Tokenizer – Google Patent

February 18, 2011

“Stopwords” are those words (and potentially phrases) that search engines and search parsers filter out from the query. In my own experience, we frequently refer to them a “noise”. Typical examples include “a”, “an” and “the”. Most electronic content management systems (“ECM’s”) which store large quantaties of text data will remove these from the full text [...]

Posted in Learning_Bit_by_Bit No Comments »

Crystal Quest Processing Clone – v.000000001

February 16, 2011

My first game –> http://itp.nyu.edu/~db1060/cgd/class04/index.html

Posted in Digital Game Design No Comments »

Chatterbot – The Interview

February 11, 2011

I’m still playing with this and getting the code to work as an applet embedded in a WordPress page is proving more complex than I thought. But, it is working and compiling and was a total hoot to play with. For those, like me, wrestling with RegEx, there are several extremely good websites on it [...]

Posted in Learning_Bit_by_Bit No Comments »

Turing vs. Searle

February 3, 2011

I’m a little embarrassed to admit that I originally read Searle’s piece more than 25 years ago. I’ve diligently re-read it, together with re-reading Turing’s piece. And, in truth, I still think Searle is hiding the ball when it comes to the core issue. In short, Searle makes completely valid arguments through about 70% of [...]

Posted in Learning_Bit_by_Bit No Comments »