Parquet VisualizerInspect and query very large parquet files fastWhat's Parquet VisualizerParquet Visualizer is a tool that helps you easily query with SQL and inspect very large parquet files fast and easy. What can you do with Parquet Visualizer?Inspect DataYou can inspect the binary data of parquet files in a human readable tabular format with pagination. You can also change the page size. Inspect Struct ValueYou can easily inspect complex struct values by clicking on the cell, which shows a popup containing the value of the struct. Run SQL Queries on a Parquet FileYou can query a parquet file with DuckDB SQL. You can also search within the result, paginate the result or change the page size. Advanced Autocomplete in Query EditorBy typing in the editor, an autocomplete box with column suggestions appear. This makes it much easier to write queries, by selecting the suggested columns. Search within rows of Query ResultBy typing free text in the search box, find specific values of the query result page. Export Query Result as CSV, Excel, JSON, ndJSON or Parquet to DiskBy clicking on the export button in the query tab, you can save your query result to a specified location in CSV, Excel, JSON, ndJSON or Parquet format to disk. Copy Query Result to ClipboardBy clicking on the copy button in the Query tab, you can copy the query result page data to the clipboard. Inspect SchemaYou can inspect the schema of the parquet file by clicking on the Schema tab, in which you can paginate if the file has many columns. You can also inspect the struct type by clicking on the cell, which will show a popup containing the struct data type. Inspect MetadataYou can inspect the metadata in tabular format by clicking on the Metadata tab. Color ThemeThe theme of the extension (dark or light) is based on your VS Code Color theme setting. If the color theme is light, the extension will load it's light theme. When you change your theme settings, all active documents will change theme automatically. ConfigurationThe following configuration options are available:
Parquet backendsThis extension supports two different types of backends for visualizing and querying parquet files. DuckDBDuckDB is the primary backend used for uncompressed and compressed parquet files (except for the BROTLI compression codec.) Parquet-wasmparquet-wasm is a backend that uses a Rust implementation of arrow and parquet. It supports all compression codecs except LZ4. FrontendThe tables of the frontend are powered by tabulator. The query editor of the frontend is powered by ace. Release NotesSee the CHANGELOG.MD ContributorsThe following people have contributed time and effort to improve Parquet Visualizer:
TelemetryTo improve the quality of Parquet Visualizer, the extension collects the following analytics such as:
Our telemetry implementation respects the vscode You can disable it via the settings by following the instructions here. You can view all the possible telemetry events that are sent by following instructions here. |