Expand on Java project using persistent data structures and additional similarity metrics.

$30-250 USD

종료됨

게시됨

2년 이상 전

$30-250 USD

제출할때 지불됩니다

It requires two programs. and create an order-32 B-tree holding word frequencies for each of at least 100 sites. Use the file system to map sites to datafiles. Use a buffer cache to speed up IO. Also, Pre-categorize (and somehow store) records into 5 to 10 clusters using k-means, k-mediods, or a similar metric. This would need to extend the previous order to display a category(cluster) and most similar key from the above data structures. The existing program is a web page categorization program. The program reads 10 (or more) web pages. The urls for these web pages can be maintained in a control file that is read when the program starts. For each page, the program maintains frequencies of words. The user can enter any other URL, and the program reports which other known page is most closely related, using a similarity metric. It uses existing library collections for all data structures, except for a custom hash table class which is maintaining word frequencies. It establishes a similarity metric. Which is part based on word-frequencies. It uses jsoup as a parser to extract words or other components. And has a gui, made with javafx.

Java

Data Analysis

프로젝트 ID: 32024327

프로젝트 정보

2 제안서

원격근무 프로젝트

활동 중 2년 전

돈을 좀 벌 생각이십니까?

이메일 주소

프리랜서 입찰의 이점

예산 및 기간 설정

작업 결과에 대한 급여 수급

제안의 개요를 자세히 쓰세요

무료로 프로젝트에 신청하고 입찰할 수 있습니다

2 이 프로젝트에 프리랜서들의 평균 입찰은 $200 USD입니다.

@DoanDucAnh

Thursday, November 4, 2021 11:55 PM Hi, client. I am Highly interested in your job because I am living on python having 7+ years experiences. Last year I have developed several python projects including e-commerence, devOps app and etc. I can show you those to prove my ability. please send me a message for more discussion. Thanks. doan.

$200 USD 7일에