Detecting cyberbullying in online content using Machine Learning & Deep Learning techniques.
- Introduction
- Features
- Installation
- Usage
- Datasets
- Evaluation Metrics
- Results
- Conclusion
- Future Work
- References
Cyberbullying is a growing concern in social media and online communities. This project leverages Machine Learning & Deep Learning models to detect cyberbullying across multiple data formats:
- Text using Bidirectional LSTM (Bi-LSTM)
- Images using Convolutional Neural Networks (CNN)
- Audio using Speech Recognition and NLP
The project aims to provide a high-accuracy cyberbullying detection system to ensure a safer online environment.
โ
Detection of cyberbullying in text, images, and audio
โ
Ensemble approach combining Bi-LSTM and CNN for better accuracy
โ
Multi-modal data support
โ
Performance evaluation using accuracy, precision, recall, and confusion matrix
โ
Potential extension to video-based analysis
Clone the repository and install dependencies:
git clone https://github.com/saiteja007-mv/Cyberbully-Detection-in-Texts-Images-and-Audios.git
cd Cyberbully-Detection-in-Texts-Images-and-Audios
pip install -r requirements.txtMake sure you have Jupyter Notebook installed:
pip install notebook1๏ธโฃ Open Jupyter Notebook
jupyter notebook2๏ธโฃ Navigate to the notebook files for text, image, and audio analysis
3๏ธโฃ Prepare your dataset with labeled cyberbullying and non-cyberbullying content
4๏ธโฃ Run the cells in the notebook to train the models
5๏ธโฃ View evaluation metrics and predictions
The models are evaluated based on:
- ๐ข Accuracy: Percentage of correctly classified instances
- ๐ Precision: Ratio of correctly predicted positive observations
- ๐ Recall: Proportion of actual positives correctly identified
- ๐ Confusion Matrix: Visual representation of true vs. predicted labels
- High accuracy achieved for text and image-based cyberbullying detection
- The ensemble approach (Bi-LSTM + CNN) improved precision & recall
- Future improvements can enhance audio analysis
This project demonstrates that deep learning models can effectively detect cyberbullying across different media types. The ensemble approach combining Bidirectional LSTM and CNN significantly improves accuracy. However, further enhancements are required for real-time video analysis.
๐น Integrate video analysis using object recognition and context analysis
๐น Enhance speech recognition for better audio-based detection
๐น Deploy as an API or web application for real-time cyberbullying detection
๐น Improve dataset diversity to enhance model generalization
๐ V. Subrahmanian and S. Kumar, Predicting human behavior: The next frontiers, Science, vol. 355, no. 6324, p. 489, 2017.
๐ H. Lauw, J. C. Shafer, R. Agrawal, and A. Ntoulas, Homophily in the digital world: A Live Journal case study, IEEE Internet Comput., vol. 14, no.2, pp. 15โ23, 2010.
๐ Arindam Mandal, Sriparna Saha, Alexandar Ferworn, A Hybrid Model for Cyberbullying Detection on Facebook.






