Skip to content

Commit 6ec9851

Browse files
authored
Merge branch 'main' into main
2 parents 7830b55 + 9313bf7 commit 6ec9851

File tree

6 files changed

+534
-6
lines changed

6 files changed

+534
-6
lines changed

.gitattributes

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
*.ipynb linguist-vendored

CONTRIBUTING.md

Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,16 @@
1+
## 🦸 Contributing to awesome-openai-vision-api-experiments
2+
3+
We love your input! We want to make contributing to awesome-openai-vision-api-experiments as easy and transparent as possible, whether it's:
4+
5+
- Reporting a bug
6+
- Discussing the current state of the code
7+
- Submitting a fix
8+
9+
## 🧪️ Adding a new experiment
10+
11+
- **We only accept experiments where the code was open-sourced.**
12+
- Add new subdirectory to `experiments` directory.
13+
- Add new entry to `automation/data.csv` file.
14+
- Run `automation/script.py`. Experiments table in `README.md` will update
15+
automatically.
16+
- Commit changes to feature branch. Create PR.

README.md

Lines changed: 19 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -14,8 +14,8 @@ Experimenting with the OpenAI API requires an API 🔑. You can get one
1414

1515
## ⚠️ Limitations
1616

17-
- 100 API requests per single API key per day
18-
- Can't be used for object detection or image segmentation
17+
- 100 API requests per single API key per day.
18+
- Can't be used for object detection or image segmentation. We can solve this problem by combining GPT-4V with foundational models like GroundingDINO or Segment Anything (SAM). Please take a look at the [example](https://github.com/roboflow/awesome-openai-vision-api-experiments/tree/main/experiments/gpt4v-grounding-dino-detection) and read our [blog post](https://blog.roboflow.com/dino-gpt-4v).
1919

2020
## 🧪 Experiments
2121

@@ -32,7 +32,10 @@ Experimenting with the OpenAI API requires an API 🔑. You can get one
3232
| zero-shot object detection with GroundingDINO + GPT-4V | [![GitHub](https://badges.aleen42.com/src/github.svg)](https://github.com/roboflow/awesome-openai-vision-api-experiments/tree/main/experiments/gpt4v-grounding-dino-detection) [![Gradio](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/spaces/Roboflow/DINO-GPT4V) | @capjamesg |
3333
| GPT-4V vs. CLIP | [![GitHub](https://badges.aleen42.com/src/github.svg)](https://github.com/roboflow/awesome-openai-vision-api-experiments/tree/main/experiments/gpt4v-vs-clip) | @capjamesg |
3434
| GPT-4V with Set-of-Mark (SoM) | [![GitHub](https://badges.aleen42.com/src/github.svg)](https://github.com/microsoft/SoM) | Jianwei Yang, Hao Zhang, Feng Li, Xueyan Zou, Chunyuan Li, Jianfeng Gao |
35-
<!--- AUTOGENERATED_EXPERIMENTS_LIST -->
35+
| GPT-4V on Web | [![GitHub](https://badges.aleen42.com/src/github.svg)](https://github.com/Jiayi-Pan/GPT-V-on-Web) | @Jiayi-Pan |
36+
| automated voiceover of NBA game | [![GitHub](https://badges.aleen42.com/src/github.svg)](https://github.com/roboflow/awesome-openai-vision-api-experiments/tree/main/experiments/automated-voiceover-of-nba-game) [![Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/roboflow/awesome-openai-vision-api-experiments/blob/main/experiments/automated-voiceover-of-nba-game/notebook.ipynb) | @SkalskiP |
37+
| screenshot-to-code | [![GitHub](https://badges.aleen42.com/src/github.svg)](https://github.com/abi/screenshot-to-code) | @abi |
38+
| GPT with Vision Checkup | [![GitHub](https://badges.aleen42.com/src/github.svg)]( https://github.com/roboflow/gpt-checkup) | Roboflow team |
3639

3740
https://github.com/roboflow/awesome-openai-vision-api-experiments/assets/26109316/c63fa3c0-4564-49ee-8982-a9e6a23dae9b
3841

@@ -44,8 +47,19 @@ by Jianwei Yang, Hao Zhang, Feng Li, Xueyan Zou, Chunyuan Li, Jianfeng Gao
4447
by Zhengyuan Yang, Linjie Li, Kevin Lin, Jianfeng Wang, Chung-Ching Lin, Zicheng Liu, Lijuan Wang
4548
- [GPT-4 System Card](https://cdn.openai.com/papers/gpt-4-system-card.pdf) by OpenAI
4649

50+
## 🖊️ Blogs
51+
52+
- [How CLIP and GPT-4V Compare for Classification](https://blog.roboflow.com/clip-vs-gpt-4v/)
53+
- [Experiments with GPT-4V for Object Detection](https://blog.roboflow.com/gpt-4v-object-detection/)
54+
- [Distilling GPT-4 for Classification with an API](https://blog.roboflow.com/gpt-4-image-classification/)
55+
- [DINO-GPT4-V: Use GPT-4V in a Two-Stage Detection Model](https://blog.roboflow.com/dino-gpt-4v/)
56+
- [First Impressions with GPT-4V(ision)](https://blog.roboflow.com/gpt-4-vision/)
57+
4758
## 🦸 Contribution
48-
I would love your help in making this repository even better! Whether you want to
49-
correct a typo, add some new experiment, or if you have any suggestions for improvement,
59+
60+
We would love your help in making this repository even better! Whether you want to
61+
add a new experiment or have any suggestions for improvement,
5062
feel free to open an [issue](https://github.com/roboflow/awesome-openai-vision-api-experiments/issues)
5163
or [pull request](https://github.com/roboflow/awesome-openai-vision-api-experiments/pulls).
64+
65+
If you are up to the task and want to add a new experiment, please look at our [contribution guide](https://github.com/roboflow/awesome-openai-vision-api-experiments/blob/main/CONTRIBUTING.md). There you can find all the information you need.

automation/data.csv

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,4 +5,7 @@ title, code, huggingface, colab, authors
55
"zero-shot object detection with GroundingDINO + GPT-4V","https://github.com/roboflow/awesome-openai-vision-api-experiments/tree/main/experiments/gpt4v-grounding-dino-detection","https://huggingface.co/spaces/Roboflow/DINO-GPT4V","",@capjamesg
66
"GPT-4V vs. CLIP","https://github.com/roboflow/awesome-openai-vision-api-experiments/tree/main/experiments/gpt4v-vs-clip","","",@capjamesg
77
"GPT-4V with Set-of-Mark (SoM)","https://github.com/microsoft/SoM","","","Jianwei Yang, Hao Zhang, Feng Li, Xueyan Zou, Chunyuan Li, Jianfeng Gao"
8-
"GPT-4V audio narration","https://github.com/roboflow/awesome-openai-vision-api-experiments/tree/main/experiments/gpt4v-narration","","",@etown
8+
"GPT-4V audio narration","https://github.com/roboflow/awesome-openai-vision-api-experiments/tree/main/experiments/gpt4v-narration","","",@etown
9+
"GPT-4V on Web","https://github.com/Jiayi-Pan/GPT-V-on-Web","","",@Jiayi-Pan
10+
"automated voiceover of NBA game","https://github.com/roboflow/awesome-openai-vision-api-experiments/tree/main/experiments/automated-voiceover-of-nba-game","","https://colab.research.google.com/github/roboflow/awesome-openai-vision-api-experiments/blob/main/experiments/automated-voiceover-of-nba-game/notebook.ipynb",@SkalskiP
11+
"GPT with Vision Checkup", https://github.com/roboflow/gpt-checkup,,, Roboflow team
Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
## Automated voiceover of NBA game 🏀

experiments/automated-voiceover-of-nba-game/notebook.ipynb

Lines changed: 493 additions & 0 deletions
Large diffs are not rendered by default.

0 commit comments

Comments
 (0)