-
Notifications
You must be signed in to change notification settings - Fork 26
What's the 'no_compression' attribute means? #11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
DeepDB by default applies compression techniques in the leaf nodes to reduce the storage overhead. Usually, this does not hurt the accuracy for numerical columns. However, if there are numerical columns with only a few distinct values or if you want to apply equality predicateson numerical columns, this compression can result in a performance regression. Hence, in these cases you should include this column into the no_compression columns. |
Thanks for your reply. python3 maqp.py --evaluate_confidence_intervals Best, |
This code evaluates the quality of the confidence intervals predicted by DeepDB. For an evaluation, you of course need the ground truth to compare the predicted intervals. In this case, the ground truth are simply the computed confidence intervals given a sample using standard statistical methods. The content of the files can also be found in this repository. We have also included the code that generates these files (cf. Command below "Optional: Create the ground truth for confidence interval. " in the repository). |
Hi,
I want to inject new datasets and what's the no_compression attribute means? Why we need to set all attributes to no_compression by default?
Best,
Kangfei
The text was updated successfully, but these errors were encountered: