Skip to content

Commit 5bf3b11

Browse files
author
Ankush Khanna
committed
Adding datset md with cab columns and zone look up.
1 parent 3b0668f commit 5bf3b11

File tree

2 files changed

+58
-1
lines changed

2 files changed

+58
-1
lines changed

README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
# Data Engineering Zoomcamp
22

3-
dataset: taxi rides NY dataset
3+
[dataset: taxi rides NY dataset](dataset.md)
44

55
Running use cases:
66

dataset.md

+57
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,57 @@
1+
[Medium article](https://medium.com/@NYCTLC/what-makes-a-city-street-smart-23496d92f60d)
2+
3+
[Trip record user guide](https://www1.nyc.gov/assets/tlc/downloads/pdf/trip_record_user_guide.pdf)
4+
5+
The data set is divided into 4 parts:
6+
7+
- Yellow cabs
8+
- Green cabs
9+
- For Hire Vehicles
10+
- High volume for hire vehicles
11+
12+
13+
14+
Below I am only concentrating on Yellow and green cabs
15+
16+
### Yellow and green cabs
17+
18+
,
19+
20+
| Columns | Definition | Example |
21+
| --------------------- | ---------- | ------------------- |
22+
| VendorID | | 2 |
23+
| lpep_pickup_datetime | | 2021-01-01 00:15:56 |
24+
| lpep_dropoff_datetime | | 2021-01-01 00:19:52 |
25+
| store_and_fwd_flag | | N, |
26+
| RatecodeID | | 1 |
27+
| PULocationID | | 43 |
28+
| DOLocationID | | 151 |
29+
| passenger_count | | 1 |
30+
| trip_distance | | 1.01 |
31+
| fare_amount | | 5.5 |
32+
| extra | | 0.5 |
33+
| mta_tax | | 0.5 |
34+
| tip_amount | | 0 |
35+
| tolls_amount | | 0 |
36+
| ehail_fee | | |
37+
| improvement_surcharge | | 0.3 |
38+
| total_amount | | 6.8 |
39+
| payment_type | | 2 |
40+
| trip_type | | 1 |
41+
| congestion_surcharge | | 0 |
42+
43+
44+
45+
### Taxi zone Loopup
46+
47+
| Columns | Definition | Example |
48+
| ------------ | ---------- | -------------- |
49+
| LocationID | | 1 |
50+
| Borough | | EWR |
51+
| Zone | | Newark Airport |
52+
| service_zone | | EWR |
53+
54+
[Shapefile from S3](https://s3.amazonaws.com/nyctlc/misc/taxi_zones.zip)
55+
56+
[Taxi zones](https://data.cityofnewyork.us/Transportation/NYC-Taxi-Zones/d3c5-ddgc)
57+

0 commit comments

Comments
 (0)