You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+16-1
Original file line number
Diff line number
Diff line change
@@ -1,4 +1,19 @@
1
-
# Categorize messages into "money credited" or "money debited" and extract the amounts mentioned in the messages based on their predicted categories.
1
+
# Categorize texts into labels(categories) and detect spam or irrelevent text.
2
+
3
+
### Example:
4
+
- Categorize messages into "money credited" or "money debited" and detect spam or irrelevent messages.
5
+
6
+
# Introduction
7
+
## What does this AI Model do ?
8
+
- You can train and build your AI Model using your own dataset in `model.py`.
9
+
- Your data set should contain list of [ text, labels ].
10
+
- After the model is trained and saved, you can try inputing your text in `runmodel.py` it would return what label it belongs to.
11
+
12
+
## Why would you need this ?
13
+
- Not only this tells you what label your text belongs to it also helps you identify if the text is 'spam or irrelevent' with each data in dataset.
14
+
15
+
## How does it identify if the text is irrelevent/spam ?
16
+
- It uses <ahref="https://medium.com/@anurag-jain/tf-idf-vectorization-with-cosine-similarity-eca3386d4423">TF-IDFVectorization with Cosine Similarity</a>
2
17
3
18
## 1. Gather Data Set and Labeling
4
19
- Gather a dataset (list of messages) and label the messages as credited or debited, then store the dataset as a CSV file.
0 commit comments