[PIPE2D-910] Update pfsDesignIds when subsetting pfsConfigs Created: 01/Oct/21  Updated: 13/Jan/22

Status: Open
Project: DRP 2-D Pipeline
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Story Priority: Normal
Reporter: rhl Assignee: price
Resolution: Unresolved Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Issue Links:
Blocks
is blocked by DAMD-122 Determine raw data distribution polic... In Progress
Relates
relates to PIPE2D-905 Restrict writing of pfsDesign/pfsConf... Done
Story Points: 1
Sprint: 2DDRP-2021 A 10, 2DDRP-2021 A11

 Description   

PIPE2D-905 restricts the ability to write subsets of pfsConfig/Design files, but I think that it'd be better to instead update the pfsDesignId and allow them to be written. It seems dangerous to have an incorrect pfsDesignId associated with data, and we do need to be able to write them to support open use.



 Comments   
Comment by price [ 01/Oct/21 ]

Changing the pfsDesignId would break the connection between the exposure (W_PFDSGN header) and the pfsDesign file.

Comment by price [ 16/Oct/21 ]

We've agreed that it's not clear exactly how we should handle all this. We think it might be important to be able to write pfsConfig subsets, e.g., for distributing censored raw data when users share fibers within the same exposure. We need input from NAOJ (e.g., Masayuki Tanaka) on how they envision data distribution of shared exposures will work.

One possibility is that when exporting data for shared exposures, we write a new "raw" exposure (manipulating pixel values to remove proprietary spectra, either by masking or subtracting the extracted spectra) with a new pfsDesignId (W_PFDSGN header) that maps to the modified pfsConfig. One thing to consider is flagging censored fibers (e.g., targetType=CENSORED and set target information to meaningless values) rather than removing them completely; I'm not sure what that gains, but it seems more honest and helpful about what's actually in the image.

No matter the exact method we use, it seems to me that data reduction based on censored raw data is always going to be inferior to the original raw data because of optical crosstalk. I hope that the pipeline-extracted spectra will be the primary output data product rather than the raw images.

Comment by Masayuki Tanaka [ 18/Oct/21 ]

NAOJ has not decided yet how we distribute raw data.  I think we do not want to modify the raw images.  My current thinking is that we replace the information of censored targets with NULL in pfsDesign and generate multiple pfsDesign files for all PIs in an exposure.  I know this will require modifications to the datamodel.  I have not discussed this yet with the STARS people and I think that is the first thing I should do.  And, what did you mean by optical crosstalk?  Overlapping spatial traces between neigboring fibers?  If so, can we extract all spectra and just not write censored fibers in the output?

Comment by price [ 19/Oct/21 ]

Yes, optical crosstalk is the overlap between neighbouring fibers in the spatial dimension. That means that the spectrum in fiber i is required to extract the spectrum from fiber i+1, and vice versa. That means that if users are going to extract spectra themselves, they need access to all spectra on the image in order to extract optimal quality spectra.

Created DAMD-122 to track the issue of raw data distribution policy, and made it a blocker.

Generated at Sat Feb 10 15:59:36 JST 2024 using Jira 8.3.4#803005-sha1:1f96e09b3c60279a408a2ae47be3c745f571388b.