Find Jobs
Hire Freelancers

Design data architecture for project integrating various structured datasets

€2-6 EUR / hour

Closed
Posted about 6 years ago

€2-6 EUR / hour

We have a science project that has outgrown its experimental setup and needs a new data architecture to enable further scaling. The task is to design one! CURRENT SETUP WHICH NEEDS TO BE UPGRADED WITH A BETTER DATA ARCHITECTURE We simply download the datasets from the structured data sources - in the native format that they are provided there (XML or .csv, for instance) - and store them locally. Datafiles are then processed by R scripts, whereby one R script can be calling several locally stored datafiles, then processing them and storing the outputs locally again. Different datasets can relate to one another with one or more common keys (identical variables). Datasets range from several hundred to several hundred thousand observations. SPECIFICATION As you see, existing setup is rather primitive, and so it is to be replaced with a new data architecture. You are rather unconstrained in coming up with an optimal solution. However, you will not only need to make a proposal, but also justify your design to us (non-experts in the best practices for database management). The task covers everything from - the server choice: what local hardware or cloud service is appropriate for minimising the cost, yet still attaining full functionality?, to - the type of the database able to efficiently handle datasets that will typically reach up to several GB in size, at most, to - database update strategy that will allow to efficiently update our new database with new datafiles, where the source providers regularly - for instance, monthly - provide a new dump file containing an updated dataset. Likewise there should be an easy way to make an update from the source that provides an API, to - ensuring and enabling full compatibility with and full optimisation for R programming language as a tool of choice to work with the data. It is important that the new data architecture is future-proof: scalable and enabling multi-year projects that rely on the collected data. BIDDING & CONTRACT The job is in both articulating your proposed data architecture and in assisting with migration from the current setup. Initially we request that you provide your: - estimated fixed bid for the entire budget - how many hours will it take you to complete most of the work - your availability in hours per day. The contractor will be selected based on the entire project bid. We understand that the specification will be further collaboratively refined during the project execution. Therefore please calculate your budget in such a way that it would cover most of the task described above (80%). Where we would like to have major extensions/additions, we will create a separate follow-up milestone with a separate budget. Consequently this first project can possibly lead to an open-ended engagement on a milestone-based or hourly-rate retainer basis. We are therefore looking for a person who would be interested in/available for an extended collaboration. In our experience with freelance contracting, we typically receive more qualified bids than we can award. Therefore, please have understanding that only shortlisted applicants will be contacted for the round two of additional questions & answers. Thank you!
Project ID: 16479899

About the project

4 proposals
Remote project
Active 6 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
4 freelancers are bidding on average €6 EUR/hour for this job
User Avatar
I have a good hands on working with Advanced Excel, R and Python and BI tools and technologies, AI, Big Data. I have quite a good knowledge of DL/ML Algorithm , have also developed Dashboards and Web Application. My area of expertise is building financial models (Stock Markets) , Image Processing and building models for food, healthcare and telecom sector, Classification/Prediction/Clustering, NLP and Chatbots. I understand the project requirement and will deliver the desired product within the time specified. I would like to hear from you. Thanks Shivam
€15 EUR in 40 days
5.0 (45 reviews)
6.1
6.1
User Avatar
A proposal has not yet been provided
€4 EUR in 10 days
0.0 (0 reviews)
0.0
0.0
User Avatar
I have experience in Microsoft Technology like SQL Server, SSIS, SSAS and SSRS. As per your requirement, I can design your Database in SQL Server. I can use SSIS (ETL Tool: Extract, Transform and Load) to load your Output file into this new database. In this manner, You don't need to change your existing process, we will just add another steps to load those output files into this new DB. To run this process on daily/weekly/quarterly basis, We can schedule our SSIS Solution through SQL Agent job. And in order to automate your existing process,We can run your R Script through SSIS Package(Executing R Script through Stored Procedure/Command Line Execution) . So we can have a master package which will call child packages to run R Script to process data files and to load those out put files into different Database tables. FYI, I have knowledge about R Programming also but as beginner only. I will take around 30 days to complete this project. It can get increase/decrease as per DB Designed approved by you.
€4 EUR in 20 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Good job. I went through your description thoroughly. as a elegant R programmer, I'll guarantee your project, entirely and strictly. Thanks. *** why me, I have rich experiences related to R projects.
€2 EUR in 40 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of GERMANY
Frankfurt, Germany
5.0
9
Payment method verified
Member since Mar 2, 2016

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.