Field of Science

Showing posts with label writing. Show all posts
Showing posts with label writing. Show all posts

Results: Replication in Psychology


My paper with Adena Schachner on replication in psychology is now published. The paper contains 3 main sections: a reasonably thorough literature review on replication rates in psychology, a proposal as to how to improve replication rates (primarily through tracking replication rates), and the results of a survey of psychologists on replication practices (many thanks to all who participated). The results of the survey was that while not nearly enough replications are attempted, there are actually more being attempted than we had guessed (or than many of our colleagues that we discussed this project with had guessed).
This paper is part of a larger collection of papers on reimagining the publication and review process, and is more of those papers are printed, I plan to discuss at least some of them.

DragonDictate

 I have been doing a great deal of writing lately, though obviously not here. I thought that perhaps at some point in graduate school, I should try getting some of the projects I have done published, and I thought that time was now. Since this requires writing them up, I have been writing. I have gotten a lot of writing done, but I noticed that this came with an increased number of hours spent sitting at my computer. Knowing enough  friends who have suffered from repetitive stress injuries, I decided I should take a proactive approach to ergonomics.

One outcome of this process is was that I purchased voice-recognition software, namely Dragon Dictate. This actually complements my preference to pace while I think. My writing style involves a lot of thinking, punctuated by occasional bursts of typing. So being able to write as I pace seemed like a good idea.

I cannot say that this experiment has been an overwhelming success. Based on what I have learned from the documentation, Dragon Dictate seems to place a great deal of faith in transitional probabilities. That is,  the hypotheses it makes about what you are saying are based not only on the sounds that you make but based on what words typically come after one another.

Of course, what words typically follow one another depends a great deal on what you are talking about. I suspect that Dragon Dictate was not trained on a corpus involving a great deal of psycholinguistics papers, but in fact it is psycholinguistics papers that I am writing. Dragon Dictate makes a number of very systematic and very annoying errors. For instance, it is absolutely convinced that, no matter how carefully I say the word “verb”, I could not possibly have meant to say that word, and probably meant "four herbs" or some such. In the general case, this is probably the right conclusion. The word “verb” is so  rarely  spoken, that it is probably a good bet that it even if you think you  heard the word “verb”, what was actually spoken was probably something else. However, since almost all my papers are about verbs, I use that word so often that probably the right hypothesis is that no matter what you think you heard, the word I actually uttered was “verb”.

Needless to say, it doesn't do very well with technical terms from semantic and syntactic theory, either.

 The upshot is that I spend so much time correcting DragonDictate's mistakes, that it is not clear that I wouldn't be better off just typing the document begin with (you can correct using voice commands, but it is so cumbersome that I usually type instead). Dragon Dictate has a function where you can feed it various documents. The documentation appears to imply that it can learn the relevant word frequencies and transitional probabilities from these documents. I have been feeding at papers I have written, in the hopes that this will help out. So far there has been limited improvement, but I am not sure just how large a corpus of needs. I will keep you updated.

(Written using DragonDictate plus hand correction.)

What are the best cognitive science blogs?

If you look to your right, you'll see I've been doing some long-needed maintenance to my blog roll. As before, I'm limiting it to blogs that I actually read (though not all the blogs I read), and I have it organized by subject matter. As I did this, I noticed that the selection of cognitive science and language blogs is rather paltry. Most of the science blogs I read -- including many not included in the blog rolls -- are written by physical scientists.

Sure there are more of them than us, but even so it seems there should be more good cognitive science and language blogs. So I'm going to crowd-source this and ask you, dear readers, who should I be reading that I'm not?

Invented Languages

Those who haven't already seen it might be interested in my article about the role of invented languages in science at ScientificAmerican.com.

Building a Better Spell-Checker


LinkToday Slate carried an interesting piece about spell-checker technology by Chris Wilson. A spell-checker typically works in the obvious way: a word you type in is compared to a dictionary. The question is where the dictionary comes from. If you use a lot of proper nouns -- or, in my case, a lot of technical jargon -- you risk the red-squiggly wrath of Microsoft Word.

It's been clear to me for a while that search engines work from much larger lexicons than do word processors. The article fills in some detail as to how they do this (not surprisingly, it involves some of the sophisticated statistics that has become so important in computer approaches to language). Read the article here.


(image borrowed from eduscapes.com)

Does literacy still matter?

In an intriguing recent article in Science Magazine (subscription required), Douglas Oard of the University of Maryland asks what the cultural consequences of better speech recognition software will be.

He notes that the reason literacy is so important is the "emphemeral nature of speech." Even after audio recording became cheap, print was still necessary because it is easier to store and easier to skim and search.

However, new technology is rapidly shifting the balance, as hardware space becomes cheap and computerized searching of audio material becomes effective. Perhaps the costs of learning to read will soon cease to be justified.

Really?

Oard recognizes there will be resistance to the idea (note that he doesn't actually endorse eliminating reading and writing), but he cautions that we should think with our heads, not our biases:

Our parents complained that our generation relied on calculators rather than learning arithmetic... In Plato's Phaedrus, the Pharaoh Thamus says of writing, "If men learn this, it will implant forgetfulness in their souls: They will cease to exercise memory because they rely on that which is written." ... Our generation will unlock the full potential of the spoken word, but it may fall to our children, and to their children, to learn how best to use that gift.


---------
D. W. Oard (2008). SOCIAL SCIENCE: Unlocking the Potential of the Spoken Word Science, 321 (5897), 1787-1788 DOI: 10.1126/science.1157353