Command Line Output:
⏳ Please wait... syncing in progress...
This application provides a platform to view and summarise the parquet catalog. It is designed to be used continuously as data arrives to monitor quality and summaries.
Data is uploaded to the folder using the incremental backup function rsync.
Any new raw data (ideally situated on a server) is automatically uploaded to the Parquet format when this application is opened.
Parquet catalogs are flexible and allow missing or misformatted data, which can be useful. However, this permissiveness also poses risks—errors often go unnoticed, leading to missing or incorrect data. Careful coding and validation are therefore essential to ensure data integrity.
A lot of the coding effort is devoted to dealing with very messy formatting. The simplest solution is to use rsync to create an incremental data repository of local data on the server. Every time this application is opened, new files or changes are automatically uploaded to the parquet database (process_device_v12.R). Note that if a file is changed and processed it will appear multiple times in the log file. For errors, only the latest error is rcorded in the log file.
The process_watch_v3.R script processes files by: