A simple toolkit to transform datasource generate by img2dataset from parquet file to Huggingface dataset.
-
Updated
Mar 21, 2023 - Python
A simple toolkit to transform datasource generate by img2dataset from parquet file to Huggingface dataset.
Example usage of python with parquet for IoT data storage and analytics
Dresser-le-procureur.com
Annual Revenue Vs. Executive Pay for Recipients of U.S. Federal Funds; uses Scala Spark in Zeppelin notebook.
Daily scraps the data from rpi-imager-stats
Apache Spark is a fast, in-memory data processing engine with elegant and expressive development API's to allow data workers to efficiently execute streaming, machine learning or SQL workloads that require fast iterative access to datasets.This project will have sample programs for Spark in Scala language .
Nushell command which saves data to a parquet file using a csv mapping
Hackolade plugin for Apache Parquet schema
a tiny sample about how to build a parquet file from whatever you need.
Add a description, image, and links to the parquet topic page so that developers can more easily learn about it.
To associate your repository with the parquet topic, visit your repo's landing page and select "manage topics."