[Dataconomy] BigDataFr recommends: High Performance Big Data Analysis Using NumPy, Numba & Python Asynchronous Programming

Big Data Analysis

BigDataFr recommends: High Performance Big Data Analysis Using NumPy, Numba & Python Asynchronous Programming

[…] At that time, based on this question, my first simple answer was the Python List object. I used the List object in many Data Science projects including Data Pipeline and Extract-Transform-Load (ETL) production system. Then the following questions came to mind: Can I use the List object for data manipulation and analysis of millions or billions of rows? What about if I divide a Data Science project into small tasks and run them asynchronously using the latest Python asyncio library? Based on these questions, I decided to spend some time and find out some practical solutions for Big Data analysis using Pythion Data Ecosystem libraries. To make it simple to understand and find the results quickly, the program will calculate the Arithmetic Mean, Median and Sample Standard Deviation values from a float one dimensional NumPy array. […]

Read more
By Ernest Bonat, Ph.D
Source: dataconomy.com

Laisser un commentaire