Ask questions about projects relating to: computer science or pure mathematics (such as probability, statistics, geometry, etc...).
Moderators: MelissaB, kgudger, Ray Trent, Moderators
by whitesa » Mon Dec 12, 2011 8:58 am
I am an eighth grader and working on background research for the project idea of building a computer app that identify's author by their writing. I was wondering if you had any recommendations for sources.
-
whitesa
-
- Posts: 2
- Joined: Mon Nov 08, 2010 5:49 pm
- Occupation: Student 7th grade
- Project Question: What are the simularities between binary, decimal, and hexadecimal notation
- Project Due Date: January 2011
- Project Status: I am just starting
by hhemken » Wed Dec 14, 2011 3:31 pm
whitesa,
Have you looked at the ScienceBuddies version?:
http://www.sciencebuddies.org/science-fair-projects/project_ideas/CompSci_p022.shtmlThe trick is to use counts of various things as described there to distinguish between authors. For more ideas, make sure you google something like this:
- Code: Select all
text analysis identifying authors
For a huge sample of texts, try Project Gutenberg:
http://www.gutenberg.org/I would recommend you use the plain text versions. You would have to cut out the extraneous stuff at the beginning and the end of texts.
If you can run your program against many large texts, you may also be able to classify them by rough date of publication, author gender, and who knows what else.
Good luck!
Heinz Hemken
Mentor
Science Buddies Expert Forum
-
hhemken
- Expert
-
- Posts: 229
- Joined: Mon Oct 03, 2005 3:16 pm
Return to Grades 6-8: Math and Computer Science
Who is online
Users browsing this forum: No registered users and 1 guest