Data analysis project
$250-750 USD
착불
I need data analysis project by python.
Let's assume that we would like to collect labels for each column as Organization, Person, Address and Other from 250,000 different datasets. For instance, different column names such as vendor_name, business, name, corporation and parent_company can be used to represent Organization and it becomes difficult to label each column manually when you have a large number of datasets. Explain your ideas and methods to efficiently obtain labels in as much detail as possible. After that I will award you.
프로젝트 ID: #21229973
프로젝트 소개
이 일자리에 대한 프리랜서 26 명의 평균 입찰가: $523
Hello, I have gone through your job posting and become very much interested to work with you. I am an expert in this field. I have already completed several projects like this. For evidence you can see my profile. Pl 기타
I am a Data Scientist with 3+ years of experience in Data Analysis, Statistical Modelling, Machine Learning, Deep Learning, Computer Vision and Natural Language Processing. I have worked across various domains such as 기타
Dear sir. Your project attracted my attention at first glance, because I've extensive experience in Data Analysis Programming. I'm really confident about your project, and very eager to join your project. If we have a 기타
HI, I am data scientist and have good experience in python and R programming. My area of interest is statistical Analysis of dataset and apply ML/deep learning algorithm. I can intern your tasks. Kind Regards
Hi, I can help you get this done. I have skills in Python, Data Processing, Machine Learning (ML), Data Mining, Statistical Analysis
Hi, First of all, your explanation was not very clear to me. Do we need to categorize the column label or something else? I am not able to understand your explanation completely. Please share more detail as I here 기타
Hello sir. I'm excited about your project, because I've really rich experience in Data Analysis Programming. I've developed many projects similar to yours and excellent skills. If you award me, I'll provide wonderfu 기타
hi, I'm a professional statistical analyst seeking opportunity to provide highest quality services in the following areas of Statistics and Econometric. Looking for outstanding opportunities to apply my academic creden 기타
Hello, I've read your project requirements thoroughly. The most possible solution for your problem would be to collect all the keywords (column names) through a program then filter them out for unique values. After th 기타
It is a job access can do it. Just simple filtration of Variable, if your data is stored. I can do your project efficiently
Hello, For what I have understand is you need to automate the process of labeling the columns from around 250000 datasets into the finite columns like organization , etc. I would propose to create a dictionary for ever 기타
My approach would be to: 1. Collect all column names from the different data sources 2. Apply basic text processing steps and regular expression methods like removal of special characters and stopwords, treatment of st 기타
My preferred method of freelancing is an interactive approach to project solving. I have an MSEE specializing in Digital Signal/Image/RF Processing. I do my work in MATLAB (expert). I also do Python programming.
Data cleaning & extraction can be done various pattern matching and regrex based on dataset given. Custom algorithm to get more effiecient extraction based on given input All models will be coded in python, so all ma 기타
That's Simple , NER - Named Entity Recognition with SpaCy or NLTK , will do your task . Process will involve from reading column names to categorizing - with tokenizing , chunking , etc . to your datasets , actually wh 기타
0/ Make data mining based on your datasets 1/ Create a list of words from all datasets using bag of words 2/ display those words on the screen 3/ use the list of stop words and create rules that will be used for separ 기타