Merge branch 'main' into main

capjamesg · web-flow · commit 6ec985120d17 · 2024-11-26T16:49:30.000Z
diff --git a/.gitattributes b/.gitattributes
@@ -0,0 +1 @@
+*.ipynb linguist-vendored
diff --git a/CONTRIBUTING.md b/CONTRIBUTING.md
@@ -0,0 +1,16 @@
+## 🦸 Contributing to awesome-openai-vision-api-experiments 
+
+We love your input! We want to make contributing to awesome-openai-vision-api-experiments as easy and transparent as possible, whether it's:
+
+- Reporting a bug
+- Discussing the current state of the code
+- Submitting a fix
+
+## 🧪️ Adding a new experiment
+
+- **We only accept experiments where the code was open-sourced.**
+- Add new subdirectory to `experiments` directory.
+- Add new entry to `automation/data.csv` file.
+- Run `automation/script.py`. Experiments table in `README.md` will update 
+automatically.
+- Commit changes to feature branch. Create PR.
diff --git a/README.md b/README.md
@@ -14,8 +14,8 @@ Experimenting with the OpenAI API requires an API 🔑. You can get one
 
 ## ⚠️ Limitations
 
-- 100 API requests per single API key per day
-- Can't be used for object detection or image segmentation
+- 100 API requests per single API key per day.
+- Can't be used for object detection or image segmentation. We can solve this problem by combining GPT-4V with foundational models like GroundingDINO or Segment Anything (SAM). Please take a look at the [example](https://github.com/roboflow/awesome-openai-vision-api-experiments/tree/main/experiments/gpt4v-grounding-dino-detection) and read our [blog post](https://blog.roboflow.com/dino-gpt-4v).
 
 ## 🧪 Experiments
 
@@ -32,7 +32,10 @@ Experimenting with the OpenAI API requires an API 🔑. You can get one
 | zero-shot object detection with GroundingDINO + GPT-4V | [![GitHub](https://badges.aleen42.com/src/github.svg)](https://github.com/roboflow/awesome-openai-vision-api-experiments/tree/main/experiments/gpt4v-grounding-dino-detection) [![Gradio](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/spaces/Roboflow/DINO-GPT4V)  | @capjamesg |
 | GPT-4V vs. CLIP | [![GitHub](https://badges.aleen42.com/src/github.svg)](https://github.com/roboflow/awesome-openai-vision-api-experiments/tree/main/experiments/gpt4v-vs-clip)   | @capjamesg |
 | GPT-4V with Set-of-Mark (SoM) | [![GitHub](https://badges.aleen42.com/src/github.svg)](https://github.com/microsoft/SoM)   | Jianwei Yang, Hao Zhang, Feng Li, Xueyan Zou, Chunyuan Li, Jianfeng Gao |
-<!--- AUTOGENERATED_EXPERIMENTS_LIST -->
+| GPT-4V on Web | [![GitHub](https://badges.aleen42.com/src/github.svg)](https://github.com/Jiayi-Pan/GPT-V-on-Web)   | @Jiayi-Pan |
+| automated voiceover of NBA game | [![GitHub](https://badges.aleen42.com/src/github.svg)](https://github.com/roboflow/awesome-openai-vision-api-experiments/tree/main/experiments/automated-voiceover-of-nba-game)  [![Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/roboflow/awesome-openai-vision-api-experiments/blob/main/experiments/automated-voiceover-of-nba-game/notebook.ipynb) | @SkalskiP |
+| screenshot-to-code | [![GitHub](https://badges.aleen42.com/src/github.svg)](https://github.com/abi/screenshot-to-code)   | @abi |
+| GPT with Vision Checkup | [![GitHub](https://badges.aleen42.com/src/github.svg)]( https://github.com/roboflow/gpt-checkup)   |  Roboflow team |
 
 https://github.com/roboflow/awesome-openai-vision-api-experiments/assets/26109316/c63fa3c0-4564-49ee-8982-a9e6a23dae9b
 
@@ -44,8 +47,19 @@ by Jianwei Yang, Hao Zhang, Feng Li, Xueyan Zou, Chunyuan Li, Jianfeng Gao
 by Zhengyuan Yang, Linjie Li, Kevin Lin, Jianfeng Wang, Chung-Ching Lin, Zicheng Liu, Lijuan Wang
 - [GPT-4 System Card](https://cdn.openai.com/papers/gpt-4-system-card.pdf) by OpenAI
 
+## 🖊️ Blogs
+
+- [How CLIP and GPT-4V Compare for Classification](https://blog.roboflow.com/clip-vs-gpt-4v/)
+- [Experiments with GPT-4V for Object Detection](https://blog.roboflow.com/gpt-4v-object-detection/)
+- [Distilling GPT-4 for Classification with an API](https://blog.roboflow.com/gpt-4-image-classification/)
+- [DINO-GPT4-V: Use GPT-4V in a Two-Stage Detection Model](https://blog.roboflow.com/dino-gpt-4v/)
+- [First Impressions with GPT-4V(ision)](https://blog.roboflow.com/gpt-4-vision/)
+
 ## 🦸 Contribution
-I would love your help in making this repository even better! Whether you want to
-correct a typo, add some new experiment, or if you have any suggestions for improvement,
+
+We would love your help in making this repository even better! Whether you want to
+add a new experiment or have any suggestions for improvement,
 feel free to open an [issue](https://github.com/roboflow/awesome-openai-vision-api-experiments/issues)
 or [pull request](https://github.com/roboflow/awesome-openai-vision-api-experiments/pulls).
+
+If you are up to the task and want to add a new experiment, please look at our [contribution guide](https://github.com/roboflow/awesome-openai-vision-api-experiments/blob/main/CONTRIBUTING.md). There you can find all the information you need.
diff --git a/automation/data.csv b/automation/data.csv
@@ -5,4 +5,7 @@ title, code, huggingface, colab, authors
 "zero-shot object detection with GroundingDINO + GPT-4V","https://github.com/roboflow/awesome-openai-vision-api-experiments/tree/main/experiments/gpt4v-grounding-dino-detection","https://huggingface.co/spaces/Roboflow/DINO-GPT4V","",@capjamesg
 "GPT-4V vs. CLIP","https://github.com/roboflow/awesome-openai-vision-api-experiments/tree/main/experiments/gpt4v-vs-clip","","",@capjamesg
 "GPT-4V with Set-of-Mark (SoM)","https://github.com/microsoft/SoM","","","Jianwei Yang, Hao Zhang, Feng Li, Xueyan Zou, Chunyuan Li, Jianfeng Gao"
-"GPT-4V audio narration","https://github.com/roboflow/awesome-openai-vision-api-experiments/tree/main/experiments/gpt4v-narration","","",@etown
+"GPT-4V audio narration","https://github.com/roboflow/awesome-openai-vision-api-experiments/tree/main/experiments/gpt4v-narration","","",@etown
+"GPT-4V on Web","https://github.com/Jiayi-Pan/GPT-V-on-Web","","",@Jiayi-Pan
+"automated voiceover of NBA game","https://github.com/roboflow/awesome-openai-vision-api-experiments/tree/main/experiments/automated-voiceover-of-nba-game","","https://colab.research.google.com/github/roboflow/awesome-openai-vision-api-experiments/blob/main/experiments/automated-voiceover-of-nba-game/notebook.ipynb",@SkalskiP
+"GPT with Vision Checkup", https://github.com/roboflow/gpt-checkup,,, Roboflow team
diff --git a/experiments/automated-voiceover-of-nba-game/README.md b/experiments/automated-voiceover-of-nba-game/README.md
@@ -0,0 +1 @@
+## Automated voiceover of NBA game 🏀
diff --git a/experiments/automated-voiceover-of-nba-game/notebook.ipynb b/experiments/automated-voiceover-of-nba-game/notebook.ipynb

Original file line number	Diff line number	Diff line change
`@@ -0,0 +1 @@`
	`1`	`+## Automated voiceover of NBA game 🏀`