mindsdb
diff --git a/‎.flake8
Lines changed: 4 additions & 0 deletions b/‎.flake8
Lines changed: 4 additions & 0 deletions
diff --git a/‎.gitignore
Lines changed: 3 additions & 1 deletion b/‎.gitignore
Lines changed: 3 additions & 1 deletion
diff --git a/‎README.md
Lines changed: 36 additions & 0 deletions b/‎README.md
Lines changed: 36 additions & 0 deletions
diff --git a/‎benchmarks/__init__.py
Lines changed: 2 additions & 0 deletions b/‎benchmarks/__init__.py
Lines changed: 2 additions & 0 deletions
diff --git a/‎benchmarks/__main__.py
Lines changed: 4 additions & 0 deletions b/‎benchmarks/__main__.py
Lines changed: 4 additions & 0 deletions
diff --git a/‎benchmarks/datasets/__init__.py b/‎benchmarks/datasets/__init__.py
@@ -0,0 +1,4 @@
+[flake8]
+# E701: multiple statements on one line (colon)
+ignore = E701
+max-line-length = 160
@@ -2,4 +2,6 @@
 *catboost_info*
 *logs.log*
 */debug.out
-.DS_Store
+repos
+*.pyc
+benchmarks/datasets/job_salary/data.csv
@@ -0,0 +1,36 @@
+# Mindsdb Benchmark Suite
+
+Note: This suite is now available to the public but it is still meant to run internally. We will provide local setup instructions, as well as the database mirror needed to compare your results against our ongoing benchmarks very soon.
+
+## Important
+
+A benchmark is identified by: a dataset name, an accuracy function, a lightwood version, a lightwood commit.
+"Running the benchmark" means running all potential dataset and accuracy function combinations for a specific lightwood version and commit.
+All benchmarks ran are logged in the database *and* the only way to look at the results will be through the database (plotting server makes this easy)
+
+## Install
+
+As usual: clone, add to python path, install requirements, make sure lightwood is installed. If this doesn't make sense then you shouldn't be running the benchmark suite and instead you should ask for a mindsdb dev environment setup tutorial from a colleague.
+
+## Useful scenarios
+
+### Benchmarking a local experiment
+
+I have a local version / commit-hash / both of lightwood and I want to bench it's accuracy against any other version of lightwood.
+
+Run the benchmarks as: `python3 benchmarks/run.py --lightwood=#env --use_ray=0` (this assume you have a single GPU, if you have multiple GPU or a beasty single GPU use `--use_ray=1`)
+Compare as: `http://0.0.0.0:9107/compare/<base lightwood version>/<your version and/or commit hash>`
+
+## Improvement ideas
+
+#### Plotting
+
+2x types of improvement, relative to the previous score *and* relative to total (aka `first - second` for acc score going from 0 to 1). We'll call these "relative" (improvement relative to previous score) and absolute (improvement relative diff on a 0 to 1 scale for accuracy functions that are capped)
+
+For both relative and absolute improvement we'll have:
+
+* Mean
+* Median
+* Box plot
+
+Also maybe tag datasets as: Classifaction, Regression, Text, Timeseries
@@ -0,0 +1,2 @@
+import uuid
+BATCH_ID = uuid.uuid4().hex
@@ -0,0 +1,4 @@
+from benchmarks.run import main
+
+if __name__ == '__main__':
+    main()
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,2 @@`
	`1`	`+import uuid`
	`2`	`+BATCH_ID = uuid.uuid4().hex`
-Original file line number
+Diff line change
@@ @@ -0,0 +1,4 @@ @@
 +from benchmarks.run import main
++
 +if __name__ == '__main__':
 +    main()