Expand on Java project using persistent data structures and additional similarity metrics.
$30-250 USD
종료됨
게시됨 2년 이상 전
$30-250 USD
제출할때 지불됩니다
It requires two programs. and create an order-32 B-tree holding word frequencies for each of at least 100 sites. Use the file system to map sites to datafiles. Use a buffer cache to speed up IO. Also, Pre-categorize (and somehow store) records into 5 to 10 clusters using k-means, k-mediods, or a similar metric. This would need to extend the previous order to display a category(cluster) and most similar key from the above data structures.
The existing program is a web page categorization program. The program reads 10 (or more) web pages. The urls for these web pages can be maintained in a control file that is read when the program starts. For each page, the program maintains frequencies of words. The user can enter any other URL, and the program reports which other known page is most closely related, using a similarity metric. It uses existing library collections for all data structures, except for a custom hash table class which is maintaining word frequencies. It establishes a similarity metric. Which is part based on word-frequencies. It uses jsoup as a parser to extract words or other components. And has a gui, made with javafx.
Thursday, November 4, 2021
11:55 PM
Hi, client.
I am Highly interested in your job because I am living on python having 7+ years experiences.
Last year I have developed several python projects including e-commerence, devOps app and etc.
I can show you those to prove my ability.
please send me a message for more discussion.
Thanks. doan.