|
6 | 6 | "id": "Zo6cemRekqqm"
|
7 | 7 | },
|
8 | 8 | "source": [
|
9 |
| - "# Big Data for Engineers – Moodle\n", |
| 9 | + "# Big Data for Engineers – Moodle - Solutions\n", |
10 | 10 | "\n",
|
11 |
| - "# Spring 2024 - Week 11 - Rumble\n", |
| 11 | + "# Spring 2025 - Week 11 - Rumble\n", |
12 | 12 | "\n"
|
13 | 13 | ]
|
14 | 14 | },
|
|
26 | 26 | "metadata": {},
|
27 | 27 | "source": [
|
28 | 28 | "## 1. Install Java and Download Rumble\n",
|
29 |
| - "Please follow the instructions in Exercise11_Rumble_2024.ipynb." |
| 29 | + "Please follow the instructions in Exercise11_Rumble_2025.ipynb." |
30 | 30 | ]
|
31 | 31 | },
|
32 | 32 | {
|
|
78 | 78 | }
|
79 | 79 | },
|
80 | 80 | "collapsed": true,
|
81 |
| - "id": "YzHCdZ2Ld-kt" |
| 81 | + "id": "YzHCdZ2Ld-kt", |
| 82 | + "jupyter": { |
| 83 | + "outputs_hidden": true |
| 84 | + } |
82 | 85 | },
|
83 | 86 | "source": [
|
84 | 87 | "## 3. Prepare the Data: The Great Language Game\n",
|
|
107 | 110 | "For each quation, you have to submit the following results to the Moodle quiz:\n",
|
108 | 111 | "- The query you wrote\n",
|
109 | 112 | "- The query output __in the required format__ (which you will be graded on)\n",
|
110 |
| - "- The time it took you to run it (optional and not graded)\n", |
111 | 113 | "\n",
|
112 | 114 | "The execution time of the queries will be reported by Rumble."
|
113 | 115 | ]
|
|
123 | 125 | "cell_type": "markdown",
|
124 | 126 | "metadata": {},
|
125 | 127 | "source": [
|
126 |
| - "## Assignment 1\n", |
127 |
| - "Find all games such that the guessed language is correct (=target), and such that this language is Spanish. What is the length of the resulting sequence?" |
| 128 | + "### **Assignment 1** - Guessed German Wrong\n", |
| 129 | + "How many games have the target language as **German** but the user’s guess was incorrect? (`guess ≠ target`)\n", |
| 130 | + "\n", |
| 131 | + "**Answer Type:** Number" |
128 | 132 | ]
|
129 | 133 | },
|
130 | 134 | {
|
|
133 | 137 | "metadata": {},
|
134 | 138 | "outputs": [],
|
135 | 139 | "source": [
|
136 |
| - "%%jsoniq\n" |
| 140 | + "%%jsoniq" |
137 | 141 | ]
|
138 | 142 | },
|
139 | 143 | {
|
140 | 144 | "cell_type": "markdown",
|
141 | 145 | "metadata": {},
|
142 | 146 | "source": [
|
143 |
| - "## Assignment 2\n", |
144 |
| - "Find the number of all distinct values of the guessed languages (i.e. the guess field). What is the length of the resulting sequence?" |
| 147 | + "### **Assignment 2** - Games on First Day\n", |
| 148 | + "How many games were played on the **first day** in the dataset?\n", |
| 149 | + "\n", |
| 150 | + "**Answer Type:** Number" |
145 | 151 | ]
|
146 | 152 | },
|
147 | 153 | {
|
|
150 | 156 | "metadata": {},
|
151 | 157 | "outputs": [],
|
152 | 158 | "source": [
|
153 |
| - "%%jsoniq\n" |
| 159 | + "%%jsoniq" |
154 | 160 | ]
|
155 | 161 | },
|
156 | 162 | {
|
157 | 163 | "cell_type": "markdown",
|
158 | 164 | "metadata": {},
|
159 | 165 | "source": [
|
160 |
| - "## Assignment 3\n", |
161 |
| - "Return the top three games where the guessed language is incorrect ($\\neq$ target) ordered by country (ascending), then target language (ascending), then date (ascending). What is the sample id of the 3rd item in the list?\n", |
| 166 | + "### **Assignment 3** - Most Guessed Language\n", |
| 167 | + "What is the most frequently guessed language in the dataset?\n", |
162 | 168 | "\n",
|
163 |
| - "Enter it without quotes, for example 48f9c924e0d98c959d8a6f1862b3ce9a" |
| 169 | + "**Answer Type:** Short Answer (Language name)" |
164 | 170 | ]
|
165 | 171 | },
|
166 | 172 | {
|
|
169 | 175 | "metadata": {},
|
170 | 176 | "outputs": [],
|
171 | 177 | "source": [
|
172 |
| - "%%jsoniq\n" |
| 178 | + "%%jsoniq" |
173 | 179 | ]
|
174 | 180 | },
|
175 | 181 | {
|
176 | 182 | "cell_type": "markdown",
|
177 | 183 | "metadata": {},
|
178 | 184 | "source": [
|
179 |
| - "## Assignment 4\n", |
180 |
| - "Aggregate all games by guessed and target language, counting the number of guessing games that were done for each pair (guess, target). How many times has Dutch been mistaken for Norwegian (i.e. Dutch was the true answer)?" |
| 185 | + "### **Assignment 4** - Third Most Incorrect Country\n", |
| 186 | + "\n", |
| 187 | + "Find the **top three countries** where users made the highest number of **incorrect guesses**. What is the **third country** in this list?\n", |
| 188 | + "\n", |
| 189 | + "**Answer Type:** Short Answer (Country code, e.g., US, FR)" |
181 | 190 | ]
|
182 | 191 | },
|
183 | 192 | {
|
|
186 | 195 | "metadata": {},
|
187 | 196 | "outputs": [],
|
188 | 197 | "source": [
|
189 |
| - "%%jsoniq\n" |
| 198 | + "%%jsoniq" |
190 | 199 | ]
|
191 | 200 | },
|
192 | 201 | {
|
193 | 202 | "cell_type": "markdown",
|
194 | 203 | "metadata": {},
|
195 | 204 | "source": [
|
196 |
| - "## Assignment 5\n", |
197 |
| - "Among all the games where the guess was correct (=target), what is the percentage of cases where the second choice (among the array of possible answers) was the target?\n", |
| 205 | + "### **Assignment 5** - Guess Equals First Choice \n", |
198 | 206 | "\n",
|
199 |
| - "Please write the fraction rounding to 4 decimals (eg. 0.3323)" |
| 207 | + "How many games have the user’s guess equal to the **first** option in the choices array? \n", |
| 208 | + "\n", |
| 209 | + "**Answer Type:** Number" |
200 | 210 | ]
|
201 | 211 | },
|
202 | 212 | {
|
|
205 | 215 | "metadata": {},
|
206 | 216 | "outputs": [],
|
207 | 217 | "source": [
|
208 |
| - "%%jsoniq\n" |
| 218 | + "%%jsoniq" |
209 | 219 | ]
|
210 | 220 | },
|
211 | 221 | {
|
212 | 222 | "cell_type": "markdown",
|
213 | 223 | "metadata": {},
|
214 | 224 | "source": [
|
215 |
| - "## Assignment 6\n", |
216 |
| - "For each target language, compute the percentage of successful guess games (i.e. guess == target) relative to all games for that target language, and display the pairs (target_language, percentage) in descending order of the percentage. What is the third language in this list?" |
| 225 | + "### **Assignment 6** - Count Anomalous Games \n", |
| 226 | + " \n", |
| 227 | + "How many games have a target language that is **not** present in the list of choices? \n", |
| 228 | + "\n", |
| 229 | + "**Answer Type:** Number" |
217 | 230 | ]
|
218 | 231 | },
|
219 | 232 | {
|
|
222 | 235 | "metadata": {},
|
223 | 236 | "outputs": [],
|
224 | 237 | "source": [
|
225 |
| - "%%jsoniq\n" |
| 238 | + "%%jsoniq" |
226 | 239 | ]
|
227 | 240 | },
|
228 | 241 | {
|
229 | 242 | "cell_type": "markdown",
|
230 | 243 | "metadata": {},
|
231 | 244 | "source": [
|
232 |
| - "## Assignment 7\n", |
233 |
| - "How many games in France (country=FR) were played on the last day?" |
| 245 | + "### **Assignment 7** – Country with Highest Correct Ratio \n", |
| 246 | + "\n", |
| 247 | + "Which country has the highest ratio of correct guesses (i.e. games where guess equals target)? Return just the country code. \n", |
| 248 | + "\n", |
| 249 | + "**Answer Type:** Short Answer (Country code, e.g., US, FR)" |
234 | 250 | ]
|
235 | 251 | },
|
236 | 252 | {
|
|
239 | 255 | "metadata": {},
|
240 | 256 | "outputs": [],
|
241 | 257 | "source": [
|
242 |
| - "%%jsoniq\n" |
| 258 | + "%%jsoniq" |
243 | 259 | ]
|
244 | 260 | }
|
245 | 261 | ],
|
|
263 | 279 | "toc_visible": true
|
264 | 280 | },
|
265 | 281 | "kernelspec": {
|
266 |
| - "display_name": "Python 3", |
| 282 | + "display_name": "Python 3 (ipykernel)", |
267 | 283 | "language": "python",
|
268 | 284 | "name": "python3"
|
269 | 285 | },
|
|
277 | 293 | "name": "python",
|
278 | 294 | "nbconvert_exporter": "python",
|
279 | 295 | "pygments_lexer": "ipython3",
|
280 |
| - "version": "3.8.8" |
| 296 | + "version": "3.12.4" |
281 | 297 | },
|
282 | 298 | "toc": {
|
283 | 299 | "base_numbering": 1,
|
284 | 300 | "nav_menu": {},
|
285 |
| - "number_sections": true, |
| 301 | + "number_sections": false, |
286 | 302 | "sideBar": true,
|
287 | 303 | "skip_h1_title": false,
|
288 | 304 | "title_cell": "Table of Contents",
|
|
294 | 310 | }
|
295 | 311 | },
|
296 | 312 | "nbformat": 4,
|
297 |
| - "nbformat_minor": 1 |
| 313 | + "nbformat_minor": 4 |
298 | 314 | }
|
0 commit comments