Greetings,
We are a software development company based in Ireland, our focus is primarily on Java. We recommend a VPS to act as a host and provide a gateway IP for the REST protocols implemented. A Apache HTTP web server will enable requests to be forwarded to an Apache Tomcat server which is used to handle Java cloud functionality, such as a web page to control the crawling server.
Once a request has been forwarded to the Apache Tomcat server, we can use Java to communicate with Apache Nutch, Apache Tika or elasticsearch. How we communicate from this point depends on how the web crawler is to be implemented.
Another suggestion we would make is to change MongoDB to MySQL, this is because MongoDB has been shown to have poor performance on virtual private servers.
Here is a short list of the services we would provide and the time frames in which we estimate.
CentOS 7 VPS install. (Week 1)
Apache HTTP web server configuration. (Week 1)
Apache Tomcat cloud server. (Week 1-2)
Apache Nutch java development (Week 2-3)
Apache Tika java development. (Week 3)
Elasticsearch java development. (Week 3-4)
Quality Assurance testing. (Week 4)
Documentation. (Week 4-5)
Our budget cost is calculated via milestones. Which are provided below. We require a deposit before work can begin, this is because freelancer takes 10% of the total budget when the project is awarded. Any further questions, please don't hesitate to ask!
Sincerely,
Owen McMonagle.
Software Eureka.