You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
FLoRA now has preprint-publication deduplication (added for #105) that:
Detects when the same paper appears as both preprint and published version (different DOIs)
Resolves confirmed duplicates, keeping the published version and storing the alternative DOI in doi_o_alt / doi_r_alt columns
Handles both replication-side (same doi_o, different doi_r) and original-side (different doi_o for the same paper) duplicates
What needs to happen
Extend dedup to FReD: The same preprint-publication detection logic (R/preprint_dedup.R) should be applied to the FReD effect-level dataset, not just the paper-level FLoRA dataset.
Add alternative DOI columns even without duplicates: Where FReD references a DOI that has a known preprint/published counterpart (from the FLoRA confirmed duplicates or CrossRef metadata), doi_o_alt and doi_r_alt should be populated even if FReD only contains one version. This ensures users can look up papers by either DOI.
References
Preprint dedup logic: R/preprint_dedup.R
FLoRA pipeline integration: Step 7c in pipelines/flora/prepare_flora.qmd
Context
FLoRA now has preprint-publication deduplication (added for #105) that:
doi_o_alt/doi_r_altcolumnsdoi_o, differentdoi_r) and original-side (differentdoi_ofor the same paper) duplicatesWhat needs to happen
Extend dedup to FReD: The same preprint-publication detection logic (
R/preprint_dedup.R) should be applied to the FReD effect-level dataset, not just the paper-level FLoRA dataset.Add alternative DOI columns even without duplicates: Where FReD references a DOI that has a known preprint/published counterpart (from the FLoRA confirmed duplicates or CrossRef metadata),
doi_o_altanddoi_r_altshould be populated even if FReD only contains one version. This ensures users can look up papers by either DOI.References
R/preprint_dedup.Rpipelines/flora/prepare_flora.qmdcache/confirmed_preprint_duplicates.csv