Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Confusion of answer pre-process #40

Open
guoyang9 opened this issue Dec 15, 2018 · 0 comments
Open

Confusion of answer pre-process #40

guoyang9 opened this issue Dec 15, 2018 · 0 comments

Comments

@guoyang9
Copy link

There are three questions confusing me, some of them may largely affect the final performance.

  1. When filtering answers, only the 'multiple_choice_answer' answer sets are pre-processed, as shown in this line, while most of the other answers remain pristine, not to mention the computation of answer occurrence.
  2. The input answer files to this function are the raw answers instead of the pre-processed answers. This could result in failure of finding answer index in processed answer set.
  3. Should we really need to do the process_digit_article? If processed in this way, some answers may become odd, for example, 'left one' will be 'left 1'. However, it brings a minor effect to the validation performance as we process the answers in both the training and validation sets. But for the testing set, further modification on code needs exploring (expanding the code to test phase).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant