crate
diff --git a/‎.readthedocs.yml
Lines changed: 29 additions & 0 deletions b/‎.readthedocs.yml
Lines changed: 29 additions & 0 deletions
diff --git a/‎CHANGES.md
Lines changed: 6 additions & 0 deletions b/‎CHANGES.md
Lines changed: 6 additions & 0 deletions
diff --git a/‎README.md
Lines changed: 42 additions & 192 deletions b/‎README.md
Lines changed: 42 additions & 192 deletions
diff --git a/‎docs/.gitignore
Lines changed: 1 addition & 0 deletions b/‎docs/.gitignore
Lines changed: 1 addition & 0 deletions
diff --git a/‎docs/Makefile
Lines changed: 20 additions & 0 deletions b/‎docs/Makefile
Lines changed: 20 additions & 0 deletions
diff --git a/‎docs/_static/.gitkeep b/‎docs/_static/.gitkeep
diff --git a/‎docs/_templates/.gitkeep b/‎docs/_templates/.gitkeep
@@ -0,0 +1,29 @@
+# .readthedocs.yml
+# Read the Docs configuration file
+
+# Details
+# - https://docs.readthedocs.io/en/stable/config-file/v2.html
+
+# Required
+version: 2
+
+build:
+  os: "ubuntu-22.04"
+  tools:
+    python: "3.11"
+
+python:
+  install:
+      - method: pip
+        path: .
+        extra_requirements:
+          - docs
+
+sphinx:
+  configuration: docs/conf.py
+  builder: html
+  fail_on_warning: true
+
+# Optionally build your docs in additional formats such as PDF
+#formats:
+#  - pdf
@@ -0,0 +1,6 @@
+# Changelog
+
+## Unreleased
+
+## 2021/05/17 1.1.0
+- Refactoring
@@ -23,38 +23,38 @@ in the spirit of TimescaleDB's Time Series Benchmark Suite (TSBS).
 
 ## About
 
-The `tsperf` program is a database workload generator including two different domains,
-one for writing data and another one for reading.
+The `tsperf` program includes both a database workload generator, and a query
+timer. That effectively spans two domains, one for writing data, and another
+one for reading.
 
-- [Data generator]: Generate time series data and feed it into database.
+- [Data Generator]: Generate time series data and feed it into database.
   Use `tsperf write --help` to explore its options.
-- [Query timer]: Probe responsiveness of database on the read path.
+- [Query Timer]: Probe responsiveness of database on the read path.
   Use `tsperf read --help` to explore its options.
 
 For the purpose of capacity testing, both domains try to simulate the generation and querying of
 time-series data. As the program is easy to use, it provides instant reward without the need to
 set up a whole data ingestion chain.
 
-[Data generator]: tsperf/write/README.md
-[Query timer]: tsperf/read/README.md
-
 
 ## Features
 
+### General
 * Generate random data which follows a statistical model to better reflect real world scenarios,
   real world data is almost never truly random.
 * The "steady load"-mode can simulate a constant load of a defined number of messages per second.
 * Ready-made to deploy and scale data generators with Docker containers. In order to maximize
-  performance, multiple instances of the Data Generator can be run in parallel.
-  This can be achieved by [using Kubernetes](KUBERNETES.md).
+  performance, multiple instances of the data generator can be run in parallel.
+  This can be achieved by [parallelizing using Kubernetes].
 * Metrics are exposed for consumption by Prometheus.
-* Data generator features
-  * Easy to define your own [schema](tsperf/write/README.md#data-generator-schemas).
-  * Full control on how many values will be inserted.
-  * Scale out to multiple clients is a core concept.
-  * Huge sets of data can be inserted without creating files as intermediate storage.
 
-### Supported databases
+### Data Generator
+* Capability to [define your own schema].
+* Full control on how many values will be inserted.
+* Scale out to multiple clients is a core concept.
+* Huge sets of data can be inserted without creating files as intermediate storage.
+
+### Database Coverage
 * CrateDB
 * InfluxDB
 * Microsoft SQL Server
@@ -64,20 +64,6 @@ set up a whole data ingestion chain.
 * Timestream
 
 
-## Prior art
-
-### TSBS
-The [Time Series Benchmark Suite (TSBS)] is a collection of Go programs that are used to generate
-datasets and then benchmark read and write performance of various databases.
-
-### cr8 + mkjson
-`mkjson` combined with `cr8 insert-json` makes it easy to generate random entries into a table.
-See [generate data sets using mkjson] for an example how to use `cr8` together with `mkjson`.
-
-[generate data sets using mkjson]: https://zignar.net/2020/05/01/generating-data-sets-using-mkjson/
-[Time Series Benchmark Suite (TSBS)]: https://github.com/timescale/tsbs
-
-
 ## Install
 
 ### Python package
@@ -87,190 +73,54 @@ pip install --user tsperf
 
 ### Docker image
 ```shell
-docker run -it --rm --network=host tsperf tsperf write --help
+alias tsperf="docker run -it --rm --network=host tsperf tsperf"
+tsperf --help
 ```
 
-
 ## Usage
 
-This section outlines the usage of `tsperf` on different databases. Please note that using Docker
-here is just for demonstration purposes. In reality, you will want to run the database workload
-against a database instance running on a decently powered machine.
-
-- For increasing concurrency, try `--concurrency=8`.
-- For enabling Prometheus metrics export, try `--prometheus-enable=true` and maybe `--prometheus-listen=0.0.0.0:8000`.
-- For increasing concurrency and number of iterations when querying, try `--concurrency=10 --iterations=2000`.
-- For displaying the list of built-in schemas, run `tsperf schema --list`.
-
-
-### CrateDB
-```shell
-# Run CrateDB
-docker run -it --rm --publish=4200:4200 --publish=5432:5432 crate:4.5.1
-
-# Feed data into CrateDB table.
-# Adjust write parameters like `--partition=day --shards=6 --replicas=3`.
-tsperf write --adapter=cratedb --schema=tsperf.schema.basic:environment.json
-tsperf write --schema=tsperf.schema.basic:environment.json --adapter=cratedb --address=cratedb.example.org:4200
-
-# Use Docker.
-docker run -it --rm --network=host tsperf tsperf write --schema=tsperf.schema.basic:environment.json --adapter=cratedb
-
-# Query data from CrateDB table.
-tsperf read --adapter=cratedb --query="SELECT * FROM environment LIMIT 10;"
-```
-
-### CrateDB+PostgreSQL
-```shell
-# Run CrateDB workload via PostgreSQL protocol.
-tsperf write --adapter=cratedbpg --schema=tsperf.schema.basic:environment.json
-tsperf read --adapter=cratedbpg --iterations=3000 --query="SELECT * FROM environment LIMIT 10;"
-
-# Run PostgreSQL workload on CrateDB.
-tsperf write --adapter=postgresql --schema=tsperf.schema.basic:environment.json
-tsperf read --adapter=postgresql --username=crate --iterations=3000 --query="SELECT * FROM environment LIMIT 10;"
-```
-
-
-### InfluxDB
-```shell
-# Run and configure InfluxDB
-docker run -it --rm --publish=8086:8086 influxdb:2.0
-influx setup --name=default --username=root --password=12345678 --org=acme --bucket=environment --retention=0 --force
-cat /Users/amo/.influxdbv2/configs
-
-# Configure tsperf
-export ADAPTER=influxdb
-export ADDRESS=http://localhost:8086/
-export INFLUXDB_ORGANIZATION=acme
-export INFLUXDB_TOKEN="X1kHPaXvS...p1IAQ=="
- 
-# Feed data into InfluxDB bucket.
-tsperf write --schema=tsperf.schema.basic:environment.json
-
-# Query data from InfluxDB bucket.
-tsperf read --query='from(bucket:"environment") |> range(start:-2h, stop:2h) |> limit(n: 10)'
-```
-
-On InfluxDB Cloud, after generating an "All Access Token", configure `tsperf` like:
-```shell
-export ADAPTER=influxdb
-export ADDRESS="https://eu-central-1-1.aws.cloud2.influxdata.com/"
-export INFLUXDB_ORGANIZATION=a05test6edtest2d
-export INFLUXDB_TOKEN="wpNtestfeNUveYitDLk8Ld47vrSVUTKB_vEaEwWC7qXj_ZqvOwYCRhQTB4EDty3uLFMXWP2C195gtestt4XGFQ=="
-```
-
-
-### Microsoft SQL Server
-```shell
-# Run Microsoft SQL Server
-docker run -it --rm --publish=1433:1433 --env="ACCEPT_EULA=Y" --env="SA_PASSWORD=yayRirr3" mcr.microsoft.com/mssql/server:2019-latest
-docker exec -it aeba7fdd4d73 /opt/mssql-tools/bin/sqlcmd -S localhost -U sa -P yayRirr3 -Q "select @@Version"
-
-# Install the Microsoft ODBC driver for SQL Server
-- Visit: https://docs.microsoft.com/en-us/sql/connect/odbc/download-odbc-driver-for-sql-server
+Please refer to the [usage] documentation.
 
-# Configure tsperf
-export ADAPTER=mssql
 
-# Feed data into MSSQL table.
-tsperf write --schema=tsperf.schema.basic:environment.json
+## Prior Art
 
-# Query data from MSSQL table.
-tsperf read --iterations=3000 --query="SELECT TOP 10 * FROM environment;"
-```
-
-
-### MongoDB
-```shell
-# Run and configure MongoDB
-docker run -it --rm --publish=27017:27017 mongo:4.4
-
-# Feed data into MongoDB collection.
-tsperf write --adapter=mongodb --schema=tsperf.schema.basic:environment.json
-
-# Query data from MongoDB collection.
-tsperf read --adapter=mongodb --schema=tsperf.schema.basic:environment.json
-
-# For connecting to MongoDB Atlas, use:
-export ADDRESS="mongodb+srv://username:[email protected]/tsperf?retryWrites=true&w=majority"
-```
-
-
-### PostgreSQL
-```shell
-# Run PostgreSQL
-docker run -it --rm --env="POSTGRES_HOST_AUTH_METHOD=trust" --publish=5432:5432 postgres:13.3
-
-# Configure tsperf
-export ADAPTER=postgresql
-
-# Feed data into PostgreSQL table.
-tsperf write --schema=tsperf.schema.basic:environment.json
-
-# Query data from PostgreSQL table.
-tsperf read --iterations=3000 --query="SELECT * FROM environment LIMIT 10;"
-```
-
-
-### TimescaleDB
-```shell
-# Run TimescaleDB
-docker run -it --rm --env="POSTGRES_HOST_AUTH_METHOD=trust" --publish=5432:5432 timescale/timescaledb:2.3.0-pg13
-
-# Configure tsperf
-export ADAPTER=timescaledb
-
-# Feed data into TimescaleDB hypertable.
-# Adjust write parameters like `--timescaledb-distributed --timescaledb-pgcopy`.
-tsperf write --schema=tsperf.schema.basic:environment.json
-
-# Query data from TimescaleDB hypertable.
-tsperf read --iterations=3000 --query="SELECT * FROM environment LIMIT 10;"
-```
-
-
-### Timestream
-```shell
-# Run Timestream
-
-# There is no way to run Amazon Timestream on premises.
-# - https://aws.amazon.com/timestream/
-# - https://docs.aws.amazon.com/timestream/
-
-# Configure tsperf
-export ADAPTER=timestream
-export ADDRESS=ingest-cell1.timestream.us-west-2.amazonaws.com
-export AWS_ACCESS_KEY_ID=AKIAIOSFODNN7EXAMPLE
-export AWS_SECRET_ACCESS_KEY=wJalrXUtnFEMI/K7MDENG/bPxRfiCYEXAMPLEKEY
-export AWS_REGION_NAME=us-west-2
+### cr8 + mkjson
+`mkjson` combined with `cr8 insert-json` makes it easy to generate random entries into a table.
+See [generate data sets using mkjson] for an example how to use `cr8` together with `mkjson`.
 
-# Feed data into Timestream table.
-tsperf write --schema=tsperf.schema.basic:environment.json
+### TSBS
+The [Time Series Benchmark Suite (TSBS)] is a collection of Go programs that are used to generate
+datasets and then benchmark read and write performance of various databases.
 
-# Query data from Timestream table.
-tsperf read --iterations=3000 --query="SELECT * FROM environment LIMIT 10;"
-```
 
+## Project Information
 
-## Contributing
-We are always happy to receive code contributions, ideas, suggestions and problem reports from the community.
+### Contributing
+We are always happy to receive code contributions, ideas, suggestions and
+problem reports from the community.
 
-So, if you’d like to contribute you’re most welcome. Spend some time taking a look around, locate a bug, design
-issue or spelling mistake and then send us a pull request or open an issue on GitHub.
+So, if you’d like to contribute you’re most welcome. Spend some time taking
+a look around, locate a bug, design issue or spelling mistake and then send
+us a pull request or open an issue on GitHub.
 
 Thanks in advance for your efforts, we really appreciate any help or feedback.
 
-
-## Acknowledgements
+### Acknowledgements
 Thanks to all the contributors who helped to co-create and conceive `tsperf`
 in one way or another and kudos to all authors of the foundational libraries.
 
-
-## License
+### License
 This project is licensed under the terms of the Apache 2.0 license.
 
 
+[Data Generator]: https://tsperf.readthedocs.io/data-generator.html
+[define your own schema]: https://tsperf.readthedocs.io/data-generator.html#data-generator-schemas
+[generate data sets using mkjson]: https://zignar.net/2020/05/01/generating-data-sets-using-mkjson/
+[parallelizing using Kubernetes]: https://tsperf.readthedocs.io/performance.html
+[Query Timer]: https://tsperf.readthedocs.io/query-timer.html
+[Time Series Benchmark Suite (TSBS)]: https://github.com/timescale/tsbs
+[Usage]: https://tsperf.readthedocs.io/usage.html
+
 [Changelog]: https://github.com/crate/tsperf/blob/main/CHANGES.md
 [Documentation]: https://tsperf.readthedocs.io/
 [Issues]: https://github.com/crate/tsperf/issues
 
@@ -0,0 +1 @@
+/_build
@@ -0,0 +1,20 @@
+# Minimal makefile for Sphinx documentation
+#
+
+# You can set these variables from the command line, and also
+# from the environment for the first two.
+SPHINXOPTS    ?=
+SPHINXBUILD   ?= sphinx-build
+SOURCEDIR     = .
+BUILDDIR      = _build
+
+# Put it first so that "make" without argument is like "make help".
+help:
+	@$(SPHINXBUILD) -M help "$(SOURCEDIR)" "$(BUILDDIR)" $(SPHINXOPTS) $(O)
+
+.PHONY: help Makefile
+
+# Catch-all target: route all unknown targets to Sphinx using the new
+# "make mode" option.  $(O) is meant as a shortcut for $(SPHINXOPTS).
+%: Makefile
+	@$(SPHINXBUILD) -M $@ "$(SOURCEDIR)" "$(BUILDDIR)" $(SPHINXOPTS) $(O)