The JSON File Manager is a Python GUI application for processing and managing JSON files. It provides functionality to clean JSON data and find exact and similar duplicate JSON files based on the 'position' field.
This is still very much WIP, and as it has the capability to delete files so please proceed with caution.
- Clean JSON data:
- Clear the 'name' field
- Write the filename to the 'name' field
- Remove the 'description' field
- Find exact duplicate JSON files based on the 'position' field
- Find similar duplicate JSON files within a specified tolerance
- Round position values to a specified decimal places
- Sort and delete duplicate/near-duplicate files
- Save results to a text file
- Clone the repository or download the script files.
- Install the required dependencies (tkinter, NumPy).
- Run the
gui.py
script to launch the JSON File Manager application. - Select the directory containing the JSON files you want to process.
- Choose the desired cleaning and duplicate finding options.
- Click the "Begin!" button to start processing the files.
- Delete or move duplicate/near-duplicate files as needed.
- Save the results to a text file if desired.
- Python 3.x
- tkinter
- NumPy
- If too many files are selected for moving/deletion the box gets cut off at the bottom
- While scanning it will display "0 matches found" despite finding matches
- The sorting for the list of duplicate files seems to be bugged
- Overhaul GUI
- Implement loading results list
- Display the position itself for each group
- Currently position data is saved if "Save Results" is selected
- Option to always ensure one of each matching file is preserved
- Optimize scanning (starts to slow down after 10K+ files)
- More customization (Instead of 'Clear Name Field' allow user to specify 'Clear x Field', etc)
- Pause/Stop/Resume functionality when scanning