Find Jobs
Hire Freelancers

data scientist neaded

$10-30 AUD

종료됨
게시됨 거의 2년 전

$10-30 AUD

제출할때 지불됩니다
In this project, you will develop an Oozie workflow to process and analyze a large volume of flight data. • Instructions: 1. Form a project team of four students (including yourself). 2. Install Hadoop/Oozie on your AWS VMs. 3. Download the Airline On-time Performance data set (flight data set) from the period of October 1987 to April 2008 on the following website: [login to view URL]:10.7910/DVN/HG7NV7 4. Design, implement and run an Oozie workflow to find out a. the 3 airlines with the highest and lowest probability, respectively, of being on schedule; b. the 3 airports with the longest and shortest average taxi time per flight (both in and out), respectively; and c. the most common reason for flight cancellations. • Requirements: 1. Your workflow must contain at least three MapReduce jobs that run in fully distributed mode. 2. Run your workflow to analyze the entire data set (total 22 years from 1987 to 2008) at one time on two VMs first and then gradually increase the system scale to the maximum allowed number of VMs for at least 5 increment steps, and measure each corresponding workflow execution time. 3. Run your workflow to analyze the data in a progressive manner with an increment of 1 year, i.e. the first year (1987), the first 2 years (1987-1988), the first 3 years (1987-1989), …, and the total of 22 years (1987-2008), on the maximum allowed number of VMs, and measure each corresponding workflow execution time. • Submission (all in a zipped file: [login to view URL]): 1. A [login to view URL] text file that lists all the commands you used to run your code and produce the required results in a fully distributed mode 2. An [login to view URL] text file that stores the final results from all the runs 3. The source code of your MapReduce programs (including the JAR files) and any other programs you might have developed and included in the workflow 4. The Oozie workflow XML file 5. A project report in PDF that includes: a. A diagram that shows the structure of your Oozie workflow b. A detailed description of the algorithm you designed to solve each of the problems c. A performance measurement plot that compares the workflow execution time in response to an increasing number of VMs used for processing the entire data set (22 years) and an in-depth discussion on the observed performance comparison results d. A performance measurement plot that compares the workflow execution time in response to an increasing data size (from 1 year to 22 years) and an in-depth discussion on the observed performance comparison results Read Less
프로젝트 ID: 33639718

프로젝트 정보

4 제안서
원격근무 프로젝트
활동 중 2년 전

돈을 좀 벌 생각이십니까?

프리랜서 입찰의 이점

예산 및 기간 설정
작업 결과에 대한 급여 수급
제안의 개요를 자세히 쓰세요
무료로 프로젝트에 신청하고 입찰할 수 있습니다
4 이 프로젝트에 프리랜서들의 평균 입찰은 $33 AUD입니다.
사용자 아바타
I am a PhD in Machine Learning with 12 years of experience in developing and deploying Statistical/ ML solutions for various organisations and institutions using Python. I am deeply expertised in the usage of ML modelling (techniques as cited in your job post), and I can solve your assignment in quick time. I am the best person for the job. please refer my previous work in Freelancer and also search me on Google Scholar to know more about my Ml modelling expertise. I will respond to your calls at any time and will implement your request as quickly and accurately as possible. Currently, I am working as a Lead Data Scientist in a top-tier US MNC where these types of modelling are my daily job profile.
$30 AUD 2일에
5.0 (8 건의 리뷰)
3.9
3.9
사용자 아바타
Hi there I read your post and I really want to work with you, if it is possible. I work as a AI engineer, you can check my profile to proof. I worked a lot of ML/DL projects before. If you want to see, I can show them. Your project is so easy for me, so I can complete it in short time with 100% quality Which type of code formation do you want to receive Jupyter notebook or .py file ? Also can you tell me exactly time ? I GUARANTEE MY WORK. If you don't like my work, I will complete it as you want and it will be free I will wait your response Best regards
$50 AUD 7일에
4.9 (3 건의 리뷰)
2.7
2.7
사용자 아바타
As a BI Developer i have a experience of 3+ years in a MNC. I worked on the Visualization tools like Tableau, Cognos, Power BI, Spotfire , ETL Tools like Informatica and Talend and Databases like SQL, PLSQL, Aurora, Oracle and Ms Access. I worked as a Backend Web developer more than 3+ years in my schooling on many projects on PHP, Django, Node JS,Java & .NET
$30 AUD 2일에
0.0 (0 건의 리뷰)
0.0
0.0
사용자 아바타
Hii, I am shivam and I am working in data analytics industry from past 2 years and have good hands on analytics technologies like Python, Spark, Hive, Sql and AWS. I am interested in helping you for your task. Let me know some more details on this project. Thanks
$20 AUD 7일에
0.0 (0 건의 리뷰)
0.0
0.0

고객에 대한 정보

국기 (EGYPT)
Cairo, Egypt
4.9
39
결제 수단 확인
10월 25, 2018부터 회원입니다

고객 확인

감사합니다! 무료 크레딧을 신청할 수 있는 링크를 이메일로 보내드렸습니다.
이메일을 보내는 동안 문제가 발생했습니다. 다시 시도해 주세요.
등록 사용자 전체 등록 건수(일자리)
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
미리 보기 화면을 준비 중...
위치 정보 관련 접근권이 허용되었습니다.
고객님의 로그인 세션이 만료되어, 자동으로 로그아웃 처리가 되었습니다. 다시 로그인하여 주십시오.