Skip to content

Commit 8adaf98

Browse files
Update README.md
1 parent 6fa42ec commit 8adaf98

File tree

1 file changed

+16
-1
lines changed

1 file changed

+16
-1
lines changed

README.md

+16-1
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,19 @@
1-
# Categorize messages into "money credited" or "money debited" and extract the amounts mentioned in the messages based on their predicted categories.
1+
# Categorize texts into labels(categories) and detect spam or irrelevent text.
2+
3+
### Example:
4+
- Categorize messages into "money credited" or "money debited" and detect spam or irrelevent messages.
5+
6+
# Introduction
7+
## What does this AI Model do ?
8+
- You can train and build your AI Model using your own dataset in `model.py`.
9+
- Your data set should contain list of [ text, labels ].
10+
- After the model is trained and saved, you can try inputing your text in `runmodel.py` it would return what label it belongs to.
11+
12+
## Why would you need this ?
13+
- Not only this tells you what label your text belongs to it also helps you identify if the text is 'spam or irrelevent' with each data in dataset.
14+
15+
## How does it identify if the text is irrelevent/spam ?
16+
- It uses <a href="https://medium.com/@anurag-jain/tf-idf-vectorization-with-cosine-similarity-eca3386d4423">TF-IDFVectorization with Cosine Similarity</a>
217

318
## 1. Gather Data Set and Labeling
419
- Gather a dataset (list of messages) and label the messages as credited or debited, then store the dataset as a CSV file.

0 commit comments

Comments
 (0)