File tree 2 files changed +58
-1
lines changed
2 files changed +58
-1
lines changed Original file line number Diff line number Diff line change 1
1
# Data Engineering Zoomcamp
2
2
3
- dataset: taxi rides NY dataset
3
+ [ dataset: taxi rides NY dataset] ( dataset.md )
4
4
5
5
Running use cases:
6
6
Original file line number Diff line number Diff line change
1
+ [ Medium article] ( https://medium.com/@NYCTLC/what-makes-a-city-street-smart-23496d92f60d )
2
+
3
+ [ Trip record user guide] ( https://www1.nyc.gov/assets/tlc/downloads/pdf/trip_record_user_guide.pdf )
4
+
5
+ The data set is divided into 4 parts:
6
+
7
+ - Yellow cabs
8
+ - Green cabs
9
+ - For Hire Vehicles
10
+ - High volume for hire vehicles
11
+
12
+
13
+
14
+ Below I am only concentrating on Yellow and green cabs
15
+
16
+ ### Yellow and green cabs
17
+
18
+ ,
19
+
20
+ | Columns | Definition | Example |
21
+ | --------------------- | ---------- | ------------------- |
22
+ | VendorID | | 2 |
23
+ | lpep_pickup_datetime | | 2021-01-01 00:15:56 |
24
+ | lpep_dropoff_datetime | | 2021-01-01 00:19:52 |
25
+ | store_and_fwd_flag | | N, |
26
+ | RatecodeID | | 1 |
27
+ | PULocationID | | 43 |
28
+ | DOLocationID | | 151 |
29
+ | passenger_count | | 1 |
30
+ | trip_distance | | 1.01 |
31
+ | fare_amount | | 5.5 |
32
+ | extra | | 0.5 |
33
+ | mta_tax | | 0.5 |
34
+ | tip_amount | | 0 |
35
+ | tolls_amount | | 0 |
36
+ | ehail_fee | | |
37
+ | improvement_surcharge | | 0.3 |
38
+ | total_amount | | 6.8 |
39
+ | payment_type | | 2 |
40
+ | trip_type | | 1 |
41
+ | congestion_surcharge | | 0 |
42
+
43
+
44
+
45
+ ### Taxi zone Loopup
46
+
47
+ | Columns | Definition | Example |
48
+ | ------------ | ---------- | -------------- |
49
+ | LocationID | | 1 |
50
+ | Borough | | EWR |
51
+ | Zone | | Newark Airport |
52
+ | service_zone | | EWR |
53
+
54
+ [ Shapefile from S3] ( https://s3.amazonaws.com/nyctlc/misc/taxi_zones.zip )
55
+
56
+ [ Taxi zones] ( https://data.cityofnewyork.us/Transportation/NYC-Taxi-Zones/d3c5-ddgc )
57
+
You can’t perform that action at this time.
0 commit comments