I tried a few different datasets to generate the text. The first was a set of press releases. The second was from filings made by companies with the Securities and Exchange Commission. press-releases text-release-generated 10q_mda generated
I tried a few different datasets to generate the text. The first was a set of press releases. The second was from filings made by companies with the Securities and Exchange Commission. press-releases text-release-generated 10q_mda generated
“Stopwords” are those words (and potentially phrases) that search engines and search parsers filter out from the query. In my own experience, we frequently refer to them a “noise”. Typical examples include “a”, “an” and “the”. Most electronic content management systems (“ECM’s”) which store large quantaties of text data will remove these from the full text [...]
My first game –> http://itp.nyu.edu/~db1060/cgd/class04/index.html
I’m still playing with this and getting the code to work as an applet embedded in a WordPress page is proving more complex than I thought. But, it is working and compiling and was a total hoot to play with. For those, like me, wrestling with RegEx, there are several extremely good websites on it [...]
I’m a little embarrassed to admit that I originally read Searle’s piece more than 25 years ago. I’ve diligently re-read it, together with re-reading Turing’s piece. And, in truth, I still think Searle is hiding the ball when it comes to the core issue. In short, Searle makes completely valid arguments through about 70% of [...]