Name : python311-parquet-py
| |
Version : 0.2.1b0
| Vendor : obs://build_opensuse_org/home:zachspar
|
Release : 20.11
| Date : 2024-10-11 00:46:35
|
Group : Unspecified
| Source RPM : python-parquet-py-0.2.1b0-20.11.src.rpm
|
Size : 3.84 MB
| |
Packager : (none)
| |
Summary : None
|
Description :
[![PyPI version](https://badge.fury.io/py/parquet-py.svg)](https://badge.fury.io/py/parquet-py)
Parquet-Py is a simple command-line interface & Python API designed to facilitate the interaction with Parquet files. It allows users to convert Parquet files into CSV, JSON, lists, and iterators for easy manipulation and access in Python applications.
Using Rust bindings under the hood, Parquet-Py provides a fast and efficient way to work with Parquet files, making it ideal for converting or processing large datasets.
- **Convert Parquet to CSV**: Convert your Parquet files into CSV format for easy viewing and processing in spreadsheet applications. - **Convert Parquet to JSON / JSON Lines**: Easily convert your Parquet files into a JSON Array or JSON Lines format for quick inspection or processing. - **Iterable Parquet Rows**: Access Parquet file rows through an iterator, allowing for efficient row-by-row processing without loading the entire file into memory. - **Convert Parquet to Python List**: Transform your Parquet files into Python lists, where each row is represented as a dictionary within the list.
`pip install parquet-py`
> [!WARNING] > > The CLI is still under development and may not be fully functional. > > Breaking changes may occur in future releases.
> [!TIP] > > Multiple input files can be specified with `--input` option. For example, `--input file1.parquet --input file2.parquet`.
To convert a Parquet file into a CSV file, use the `parq convert` command.
```bash parq convert --input path/to/your/file.parquet --format csv --output example.csv ```
To convert a Parquet file into a JSON Array, use the `parq convert` command.
```bash parq convert --input path/to/your/file.parquet --format json --output example.json ```
To convert a Parquet file into a JSON Lines, use the `parq convert` command.
```bash parq convert --input path/to/your/file.parquet --format jsonl --output example.jsonl ```
To iterate over the rows of a Parquet file, use the `iter_rows` function. This allows for efficient row-by-row processing without loading the entire file into memory.
```python from parq import to_iter
file_path = \"path/to/your/file.parquet\"
for row in to_iter(file_path): print(row) ```
To convert a Parquet file into a CSV string, use the `to_csv_str` function.
```python from parq import to_csv_str
file_path = \"path/to/your/file.parquet\"
csv_str = to_csv_str(file_path) print(csv_str) ```
To convert a Parquet file into a JSON string, use the `to_json_str` function.
```python from parq import to_json_str
file_path = \"path/to/your/file.parquet\"
json_str = to_json_str(file_path) print(json_str) ```
To convert a Parquet file into a Python list, where each row is represented as a dictionary within the list, use the `to_list` function.
```python from parq import to_list
file_path = \"path/to/your/file.parquet\"
data_list = to_list(file_path) print(len(data_list)) ```
|
RPM found in directory: /packages/linux-pbone/ftp5.gwdg.de/pub/opensuse/repositories/home:/zachspar/openSUSE_Tumbleweed/x86_64 |
Hmm ... It's impossible ;-) This RPM doesn't exist on any FTP server
Provides :
libparq.so()(64bit)
python3-parquet-py
python3.11dist(parquet-py)
python311-parquet-py
python311-parquet-py(x86-64)
python3dist(parquet-py)
Requires :