COMS 3997 NLP Final Project by Team: Amanda Lee Jenkins(alj2155), Meng Fan Wang(mw3751), Lorraine Xu(yx2950) Full Work in https://drive.google.com/drive/folders/1o27BHR4dDVF7RN2jEJg55CMJC07nvvME?usp=drive_link.
To get CNN data, visit this link: https://www.kaggle.com/datasets/gowrishankarp/newspaper-text-summarization-cnn-dailymail?resource=download. Download test.csv to your local computer. Then drag the file to the "Files" section on the left of your Jupyter Notebook.
To get Reddit data, visit this link: https://drive.google.com/file/d/1ffWfITKFMJeqjT8loC8aiCLRNJpc_XnF/view. Download tifu_all_tokenized_and_filtered.json to your local computer. Then drag the file to the "Files" section on the left of your Jupyter Notebook.