Name : python310-dask-dataframe
| |
Version : 2024.6.2
| Vendor : openSUSE
|
Release : 2.1
| Date : 2024-07-11 11:46:12
|
Group : Unspecified
| Source RPM : python-dask-2024.6.2-2.1.src.rpm
|
Size : 2.45 MB
| |
Packager : https://bugs_opensuse_org
| |
Summary : Pandas-like DataFrame data structure for dask
|
Description :
A flexible library for parallel computing in Python.
Dask is composed of two parts: - Dynamic task scheduling optimized for computation. This is similar to Airflow, Luigi, Celery, or Make, but optimized for interactive computational workloads. - “Big Data” collections like parallel arrays, dataframes, and lists that extend common interfaces like NumPy, Pandas, or Python iterators to larger-than-memory or distributed environments. These parallel collections run on top of dynamic task schedulers.
This package contains the dask DataFrame class.
A Dask DataFrame is a large parallel dataframe composed of many smaller Pandas dataframes, split along the index. These pandas dataframes may live on disk for larger-than-memory computing on a single machine, or on many different machines in a cluster.
|
RPM found in directory: /packages/linux-pbone/ftp5.gwdg.de/pub/opensuse/repositories/openSUSE:/Factory:/RISCV:/ToTest/images/repo/openSUSE-20240817-riscv64-Media1/noarch |