Reconstruction statements mentioned in the paper #30532

liumc14 · 2024-04-29T06:58:05Z

Hello, the data set here is the squad data set, but the three domain data sets created in the paper do not seem to be reflected in the code, and it seems that the reconstruction statements in the three domain data sets disclosed in the paper are in the source and It's the same in target. Why is this? @shamanez

transformers/examples/research_projects/rag-end2end-retriever/utils_rag.py

Line 62 in 73014b5

self.src_file = Path(data_dir).joinpath(type_path + ".source")

amyeroberts · 2024-04-29T08:05:05Z

Hi, thanks for raising an issue!

This is a question best placed in our forums. We try to reserve the github issues for feature requests and bug reports.

shamanez · 2024-04-29T08:47:15Z

@liumc14, you are correct. I open-sourced this code before my paper. Also, to keep the architecture clean, I didn't add the reconstruction statement.

But it is pretty straightforward

Mix the data QA and Recon while having an identifier.
Then, during the forward computation, only use the retrieved documents as the inputs to the Generator when training data is related to the reconstruction signal.

liumc14 · 2024-04-29T08:57:47Z

@liumc14, you are correct. I open-sourced this code before my paper. Also, to keep the architecture clean, I didn't add the reconstruction statement.

But it is pretty straightforward

Mix the data QA and Recon while having an identifier.

Then, during the forward computation, only use the retrieved documents as the inputs to the Generator when training data is related to the reconstruction signal.
@shamanez But in the three domain-specific data set download links you provided in the paper (https://drive.google.com/drive/folders/1up3yKcJFArBQ6e0F_6n_mfW1VPHxA20A), I found after downloading the data set that the reconstruction in the .source file in the training set The statement has the same result as in .target, for example:
American Civil Liberties Union, ACLU of Arizona, National Immigration Law Center slam law. American Civil Liberties Union, ACLU of Arizona, National Immigration Law Center slam law. In this case, rebuild the statement Can it still be used for training?

shamanez · 2024-04-30T00:31:21Z

Yes, the statement should be re-constructed. But the input to the generator should be the retrieved docs related to the statement.

liumc14 · 2024-04-30T00:42:13Z

Yes, the statement should be re-constructed. But the input to the generator should be the retrieved docs related to the statement.

@shamanez So the training of reconstructed statements actually involves inputting reconstructed statements, retrieving related documents, and letting the generator generate reconstructed statements based on the relevant documents? Thank you for your advice

shamanez · 2024-04-30T02:19:44Z

Correct

…

On Tue, 30 Apr 2024 at 12:42 PM, liumc14 ***@***.***> wrote: Yes, the statement should be re-constructed. But the input to the generator should be the retrieved docs related to the statement. @shamanez <https://github.com/shamanez> So the training of reconstructed statements actually involves inputting reconstructed statements, retrieving related documents, and letting the generator generate reconstructed statements based on the relevant documents? Thank you for your advice — Reply to this email directly, view it on GitHub <#30532 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AEA4FGTMZZC6ZSSN744OGBLY73SHXAVCNFSM6AAAAABG5XZ6RSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDAOBTHE3TCMRVGU> . You are receiving this because you were mentioned.Message ID: ***@***.***>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reconstruction statements mentioned in the paper #30532

Reconstruction statements mentioned in the paper #30532

liumc14 commented Apr 29, 2024 •

edited

amyeroberts commented Apr 29, 2024

shamanez commented Apr 29, 2024

liumc14 commented Apr 29, 2024 •

edited

shamanez commented Apr 30, 2024

liumc14 commented Apr 30, 2024

shamanez commented Apr 30, 2024 via email

Reconstruction statements mentioned in the paper #30532

Reconstruction statements mentioned in the paper #30532

Comments

liumc14 commented Apr 29, 2024 • edited

amyeroberts commented Apr 29, 2024

shamanez commented Apr 29, 2024

liumc14 commented Apr 29, 2024 • edited

shamanez commented Apr 30, 2024

liumc14 commented Apr 30, 2024

shamanez commented Apr 30, 2024 via email

liumc14 commented Apr 29, 2024 •

edited

liumc14 commented Apr 29, 2024 •

edited