To enable and control access to data on a system database via an API.
Say for instance, an application wants access to credit card data for a particular user. A table has 50 columns and the application doesn’t need the data in all 50 columns, only 3. The application may want to access a user's ssn, phone, monthly amount spent. The API should expose data from these 3 entities (not concerned about the other 47 entities). Provide an endpoint for the user by extracting the data and outputting in a text file or on a browser in JSON. This API enables and controls access to the data. In the future, if there is an application that requires data from only these 3 entities (SSN, phone, monthly amount) this API can be used again.
What the Deliverable Is:
Developer must utilize an API enablement skeleton (already developed) that exposes Hive data entities and convert into an API that exposes Hbase data entities by linking to Spark, creating pyspark data frame, converting to pandas data frame, transfer to Flask REST API. Once the data is extracted it should be outputted at the endpoint as a text file or via a browser in JSON. Code must run in my environment using Python 2.7 and Spark 1.6 versions. Seller must provide extensive comments within the scripts and help me setup the environment if need be.
Who Should Apply for the Job:
If you're an expert in Python, Pyspark, Spark, Hbase, understand how to link Hbase and Spark and you're very familiar with data extraction/data mining then this is the job for you!
Python, Hbase, Spark, Pyspark, Pandas, Flask, JSON
*NOTE* Please review all attached files to understand what's happening and how the code works before applying for the job
43 фрилансеров(-а) в среднем готовы выполнить эту работу за $36/час
Greetings! I am an Expert Python, Hbase, Spark, Pyspark, Pandas, Flask, JSON Developer, You can review my most recent works here: [login to view URL] [login to view URL] Regards, Ashwani