graviti logo产品公开数据集关于我们
登录
197
0
5
NeutralizingBiasedText
创建来自Hello Dataset
概要
代码
活动

Overview

Concretely this means algorithms for

  • Identifying biased words in sentences.
  • Neutralizing bias in sentences.

firstpage

harvest/: Code for making the dataset. It works by crawling and filtering Wikipedia for bias-driven edits.

src/: Code for training models and using trained models to run inference. The models implemented here are referred to as MODULAR and CONCURRENT in the paper.

Instruction

These commands will download data and a pretrained model before running inference.

$ cd src/
$ python3 -m venv venv
$ source venv/bin/activate
$ pip install -r requirements.txt
$ python
>> import nltk; nltk.download("punkt")
$ sh download_data_ckpt_and_run_inference.sh

You can also run sh integration_test.sh to further verify that everything is installed correctly and working as it should be.

License

MIT

🎉感谢Hello Dataset的贡献
数据集信息
应用场景NLP
标注类型Text
任务类型暂无
LicenseMIT
更新时间2021-03-24 22:49:09
数据概要
数据格式Text
数据数量0
已标注数量0
文件大小105KB
版权归属方
Stanford University
标注方
未知
了解更多和支持
相关数据集
立即开始构建AI
免费开始联系我们