frequency counter in text using struct

완료 등록 시간: Sep 24, 2015 착불
완료 착불

A Frequency Counter

Our word frequency counter allows you to count the frequency usage of each word in your text.

That MS Word add-on created a list of all the words in a document, ordered by frequency. It made it easy to

detect overuse and/or abuse of a certain word or expression. The little used words were also of help, because it

may find errors that the spelling checker does not detect.

Automated authorship detection is the process of using a computer program to analyze a large collection of

texts, one of which has an unknown author, and making guesses about the author of that unattributed text. The

basic idea is to use different statistics from the text -- called "features" in the machine learning community -- to

form a linguistic "signature" for each text. One example of a simple feature is Type-Token Ratio - the number

of different words used in a text divided by the total number of words. It's a measure of how repetitive the

vocabulary is.

The documents to be checked is just regular text files (i.e., sequences of characters) and it has one word per line

--- you do not have to extract a word. It means this project does not depend on previous lab projects. The

frequency counter reads in the file to get words from the file. Each word in the file is counted and sorted

properly to produce output:

 Word list in frequency order -- high to low (first 10 highest and last 10 lowest)

 Word list in alphabetical order -- high to low (first 10 in alphabetical order and last 10 in alphabetical

order)

 Type-Token Ratio

input:

frequency.txt.

output:

 Word list in frequency order -- high to low (first 10 highest and last 10 lowest)

 Word list in alphabetical order -- high to low (first 10 in alphabetical order and last 10 in alphabetical

order)

 Type-Token Ratio

C++ 프로그래밍

프로젝트 ID: #8549076

프로젝트 소개

5 건(제안서) 재택 근무형 프로젝트 서비스 이용 중: Sep 24, 2015

수상자:

vano101

Hello, This is easy assignment, it can be completed in about 1-2 hrs .........................................

$10 USD (1일 이내)
(568건의 리뷰)
6.7

이 일자리에 대한 프리랜서 5 명의 평균 입찰가: $18

it2051229

Hi, I have read the requirements and I would like to do this project if given the opportunity. I am a computer science practitioner and having the "C++ and Algorithms Certification Badge" in my profile gives me confide 기타

$25 USD (1일 이내)
(303 리뷰)
6.3
urmate

A proposal has not yet been provided

$20 USD (0일 이내)
(84 리뷰)
5.4
ithuang2014

A proposal has not yet been provided

$24 USD (1일 이내)
(27 리뷰)
4.1
rinabuoy13

A proposal has not yet been provided

$25 USD (1일 이내)
(7 리뷰)
2.9
ajaynishad

A proposal has not yet been provided

$10 USD (1일 이내)
(0 리뷰)
0.0