Move Dataset API from telemetry-batch-view to its own package on maven #1

vitillo · 2017-01-27T18:57:51Z

See Bug 1283446. Since I was at it I completely rewrote the test suite using fakes3. I am planning to add CI integration before this gets merged.

mreid-moz · 2017-01-27T19:50:03Z

Does the Heka-reading code work with the gzipped format a-la this pr and bug 1302264?

whd · 2017-01-27T20:04:55Z

src/test/scala/com/mozilla/telemetry/utils/S3StoreTest.scala

+  }
+
+  it can "read gzipped files" in {
+    /* Not supported yet https://github.com/jubos/fake-s3/pull/52


The referenced PR was closed recently, so perhaps gzip is supported? I am guessing fake-s3 is your answer to the more general testing issues discussed in mozilla/telemetry-batch-view#126.

I rewrote the testing infrastructure to address the more general testing issues with telemetry-batch-view.

While the Dataset API supports gzipped files (it's the same code we are using in telemetry-batch-view) fake-s3 doesn't just yet. In other words we can't write the test for it but we will be able to do so very soon.

codecov-io · 2017-02-02T14:32:31Z

Codecov Report

❗ No coverage uploaded for pull request base (master@22473f2). Click here to learn what that means.

@@            Coverage Diff            @@
##             master       #1   +/-   ##
=========================================
  Coverage          ?   99.15%           
=========================================
  Files             ?        5           
  Lines             ?      119           
  Branches          ?       21           
=========================================
  Hits              ?      118           
  Misses            ?        1           
  Partials          ?        0

Impacted Files	Coverage Δ
...ain/scala/com/mozilla/telemetry/heka/Dataset.scala	`100% <100%> (ø)`
...c/main/scala/com/mozilla/telemetry/heka/File.scala	`100% <100%> (ø)`
...rc/main/scala/com/mozilla/telemetry/utils/S3.scala	`100% <100%> (ø)`
...la/com/mozilla/telemetry/utils/ObjectSummary.scala	`100% <100%> (ø)`
...ain/scala/com/mozilla/telemetry/heka/package.scala	`90.9% <90.9%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 22473f2...e5ad631. Read the comment docs.

maurodoglio

The code looks good, but there are a couple of things that worry me:
1- from a licence standpoint it may be easier to use the moto standalone server rather than fakeS3
2- iiuc users of the library need to run the fakeS3 server by hand before they can run the tests. This should be at least put in a README file and eventually automated as part of the test suite setup. The latter can probably wait though

mreid-moz · 2017-02-06T15:49:53Z

Can we add a link somewhere - either in the Heka-related code or in the README - to where the src/main/protobuf/heka.proto comes from?

mreid-moz

Looks good, added a few nits, +1 on Mauro's comment about documenting the fake S3 server for tests.

mreid-moz · 2017-02-06T15:53:13Z

src/main/scala/com/mozilla/telemetry/heka/Dataset.scala

+    }
+
+    if (!schema.dimensions.contains(Dimension(dimension))) {
+      throw new Exception(s"The dimension $dimension doesn't exists")


s/exists/exist/

mreid-moz · 2017-02-06T15:58:16Z

src/main/scala/com/mozilla/telemetry/heka/File.scala

+import java.io.InputStream
+import org.xerial.snappy.Snappy
+
+object File{


nit: Add a space before the { (and can we add a style check for that?)

mreid-moz · 2017-02-06T16:05:11Z

src/test/scala/com/mozilla/telemetry/heka/DatasetTest.scala

+import org.apache.spark.{SparkConf, SparkContext}
+import org.scalatest.{BeforeAndAfterAll, FlatSpec, Matchers}
+
+class DatasetTest extends FlatSpec with Matchers with BeforeAndAfterAll{


nit: please add a space before {

mreid-moz · 2017-02-06T16:08:05Z

scalastyle-config.xml

+ <check level="warning" class="org.scalastyle.file.FileLineLengthChecker" enabled="true">
+  <parameters>
+   <parameter name="maxLineLength"><![CDATA[160]]></parameter>
+   <parameter name="tabSize"><![CDATA[4]]></parameter>


Indentation in the scala code is all 2 spaces - should we set the tab size to 2 as well?

vitillo · 2017-02-07T14:23:57Z

All done.

whd reviewed Jan 27, 2017

View reviewed changes

vitillo force-pushed the second branch 8 times, most recently from 9932be4 to e7baf9f Compare February 2, 2017 14:22

vitillo force-pushed the second branch from e7baf9f to 1ba6fca Compare February 2, 2017 14:46

vitillo requested a review from maurodoglio February 6, 2017 13:27

maurodoglio suggested changes Feb 6, 2017

View reviewed changes

mreid-moz reviewed Feb 6, 2017

View reviewed changes

vitillo force-pushed the second branch 6 times, most recently from 373987b to 3679330 Compare February 7, 2017 14:18

First commit.

e5ad631

vitillo force-pushed the second branch from 3679330 to e5ad631 Compare February 7, 2017 14:21

vitillo merged commit 1f0e1d1 into master Feb 7, 2017

vitillo self-assigned this Feb 10, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Move Dataset API from telemetry-batch-view to its own package on maven #1

Move Dataset API from telemetry-batch-view to its own package on maven #1

Uh oh!

vitillo commented Jan 27, 2017

Uh oh!

mreid-moz commented Jan 27, 2017

Uh oh!

whd Jan 27, 2017

Uh oh!

vitillo Jan 31, 2017 •

edited

Loading

Uh oh!

codecov-io commented Feb 2, 2017 •

edited

Loading

Uh oh!

maurodoglio left a comment

Uh oh!

mreid-moz commented Feb 6, 2017

Uh oh!

mreid-moz left a comment

Uh oh!

mreid-moz Feb 6, 2017

Uh oh!

mreid-moz Feb 6, 2017

Uh oh!

mreid-moz Feb 6, 2017

Uh oh!

mreid-moz Feb 6, 2017

Uh oh!

vitillo commented Feb 7, 2017

Uh oh!

Uh oh!

Move Dataset API from telemetry-batch-view to its own package on maven #1

Move Dataset API from telemetry-batch-view to its own package on maven #1

Uh oh!

Conversation

vitillo commented Jan 27, 2017

Uh oh!

mreid-moz commented Jan 27, 2017

Uh oh!

whd Jan 27, 2017

Choose a reason for hiding this comment

Uh oh!

vitillo Jan 31, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

codecov-io commented Feb 2, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

maurodoglio left a comment

Choose a reason for hiding this comment

Uh oh!

mreid-moz commented Feb 6, 2017

Uh oh!

mreid-moz left a comment

Choose a reason for hiding this comment

Uh oh!

mreid-moz Feb 6, 2017

Choose a reason for hiding this comment

Uh oh!

mreid-moz Feb 6, 2017

Choose a reason for hiding this comment

Uh oh!

mreid-moz Feb 6, 2017

Choose a reason for hiding this comment

Uh oh!

mreid-moz Feb 6, 2017

Choose a reason for hiding this comment

Uh oh!

vitillo commented Feb 7, 2017

Uh oh!

Uh oh!

vitillo Jan 31, 2017 •

edited

Loading

codecov-io commented Feb 2, 2017 •

edited

Loading