<!-- 
RSS generated by JIRA (8.3.4#803005-sha1:1f96e09b3c60279a408a2ae47be3c745f571388b) at Sat Feb 10 16:46:13 JST 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>PFS-JIRA</title>
    <link>https://pfspipe.ipmu.jp/jira</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>8.3.4</version>
        <build-number>803005</build-number>
        <build-date>13-09-2019</build-date>
    </build-info>


<item>
            <title>[SCIDB-68] prototype v2: directory structure of the 2d outputs</title>
                <link>https://pfspipe.ipmu.jp/jira/browse/SCIDB-68</link>
                <project id="10601" key="SCIDB">Science Database</project>
                    <description>&lt;p&gt;Come up with a tentative plan for the output directory structure.&lt;/p&gt;</description>
                <environment></environment>
        <key id="13107">SCIDB-68</key>
            <summary>prototype v2: directory structure of the 2d outputs</summary>
                <type id="10001" iconUrl="https://pfspipe.ipmu.jp/jira/secure/viewavatar?size=xsmall&amp;avatarId=10515&amp;avatarType=issuetype">Story</type>
                                            <priority id="10000" iconUrl="https://pfspipe.ipmu.jp/jira/images/icons/priorities/medium.svg">Normal</priority>
                        <status id="10002" iconUrl="https://pfspipe.ipmu.jp/jira/images/icons/statuses/generic.png" description="The issue is resolved, reviewed, and merged">Done</status>
                    <statusCategory id="3" key="done" colorName="green"/>
                                    <resolution id="10000">Done</resolution>
                                        <assignee username="hassan">hassan</assignee>
                                    <reporter username="msyktnk">Masayuki Tanaka</reporter>
                        <labels>
                    </labels>
                <created>Wed, 14 Nov 2018 05:02:33 +0000</created>
                <updated>Mon, 19 Nov 2018 23:53:11 +0000</updated>
                            <resolved>Mon, 19 Nov 2018 23:53:11 +0000</resolved>
                                                                        <due></due>
                            <votes>0</votes>
                                    <watches>3</watches>
                                                                <comments>
                            <comment id="14611" author="hassan" created="Wed, 14 Nov 2018 21:27:06 +0000"  >&lt;p&gt;I discussed this with the Princeton team. In short, the directory structure is specified by you.&lt;/p&gt;

&lt;p&gt;PFS scripts make use of the LSST Butler concept for input/output. All input data and output products are read from/written to what is known as a &apos;Butler Data Repository&apos;. On a file system, this corresponds to a single directory. For each PFS script called, this directory needs to be passed as the first argument.&lt;/p&gt;

&lt;p&gt;Below that directory, a number of subdirectories can be created, for calibs, and for output data. The names of those are also specified by the user when the PFS script in question is called. So you can decide the names of those too.&lt;/p&gt;

&lt;p&gt;See &lt;a href=&quot;https://pipelines.lsst.io/v/DM-11034/getting-started/data-setup.html&quot; class=&quot;external-link&quot; rel=&quot;nofollow&quot;&gt;https://pipelines.lsst.io/v/DM-11034/getting-started/data-setup.html&lt;/a&gt; as an example. I understand that &lt;a href=&quot;https://pfspipe.ipmu.jp/jira/secure/ViewProfile.jspa?name=sogo.mineo&quot; class=&quot;user-hover&quot; rel=&quot;sogo.mineo&quot;&gt;sogo.mineo&lt;/a&gt; is also aware of Butler concepts.&lt;/p&gt;

&lt;p&gt;Please drop me a comment or message me if I have misunderstood the request.&lt;/p&gt;

</comment>
                            <comment id="14612" author="price" created="Wed, 14 Nov 2018 21:37:08 +0000"  >&lt;p&gt;The directory structure we&apos;re currently using is in specified in the &lt;a href=&quot;https://github.com/Subaru-PFS/obs_pfs/blob/master/policy/PfsMapper.yaml&quot; class=&quot;external-link&quot; rel=&quot;nofollow&quot;&gt;PfsMapper.yaml&lt;/a&gt; file; I think this is a reasonable starting point.&lt;/p&gt;</comment>
                            <comment id="14614" author="msyktnk" created="Thu, 15 Nov 2018 07:13:53 +0000"  >&lt;p&gt;OK.  I think one possible option might be to have directories like:&lt;br/&gt;
&quot;survey_sim/v%02d/%05d/%s/&quot; % (simulation_version, tract,patch)&lt;br/&gt;
and put the psfObject files there:&lt;br/&gt;
&quot;pfsObject-%05d-%s-%3d-%08x-%02d-0x%08x.fits&quot; % (tract, patch, catId, objId, nVisit % 100, pfsVisitHash)&lt;br/&gt;
Does this look reasonable?&lt;/p&gt;</comment>
                            <comment id="14615" author="price" created="Thu, 15 Nov 2018 21:25:09 +0000"  >&lt;p&gt;The &lt;a href=&quot;https://github.com/Subaru-PFS/datamodel/blob/master/datamodel.txt#L302-L305&quot; class=&quot;external-link&quot; rel=&quot;nofollow&quot;&gt;datamodel&lt;/a&gt; currently specifies:&lt;/p&gt;

&lt;blockquote&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;
     &lt;span class=&quot;code-quote&quot;&gt;&quot;pfsObject-%05d-%s-%3d-%08x-%02d-0x%08x.fits&quot;&lt;/span&gt; % (tract, patch, catId, objId, nVisit % 100, pfsVisitHash)
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;The path would be&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;
   tract/patch/pfsObject-*.fits
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;&lt;/blockquote&gt;

&lt;p&gt;Of course, this is under a &lt;tt&gt;rerun&lt;/tt&gt; directory within the data repository, so the full path would be something like:&lt;/p&gt;
&lt;div class=&quot;code panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;codeContent panelContent&quot;&gt;
&lt;pre class=&quot;code-java&quot;&gt;
/path/to/dataRepo/rerun/&amp;lt;rerunName&amp;gt;/&amp;lt;tract&amp;gt;/&amp;lt;patch&amp;gt;/pfsObject-&amp;lt;stuff&amp;gt;.fits
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;Please note that &lt;b&gt;this is subject to change&lt;/b&gt;, for a couple of reasons:&lt;br/&gt;
1. There&apos;s no leading &lt;tt&gt;pfsObject&lt;/tt&gt; element in the directory structure separating this data type from others, unlike for products like &lt;a href=&quot;https://github.com/Subaru-PFS/obs_pfs/blob/master/policy/PfsMapper.yaml#L129&quot; class=&quot;external-link&quot; rel=&quot;nofollow&quot;&gt;pfsArm&lt;/a&gt;; I expect this is an oversight that we will remedy, and the reason that we haven&apos;t yet is simply because we don&apos;t yet have a component in the 2D pipeline that writes pfsObject files so the problem hasn&apos;t been noticed before now.&lt;br/&gt;
2. I expect the datamodel to change a bit as a result of the 2D pipeline design that is currently under internal review.&lt;/p&gt;

&lt;p&gt;Nevertheless, if you use the data butler to read and write the data products, then the need to know the precise directory structure vanishes, and the work required to adapt to changes is small, if any.&lt;/p&gt;</comment>
                            <comment id="14619" author="msyktnk" created="Mon, 19 Nov 2018 05:40:02 +0000"  >&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt; /path/to/dataRepo/rerun/&amp;lt;rerunName&amp;gt;/&amp;lt;tract&amp;gt;/&amp;lt;patch&amp;gt;/pfsObject-&amp;lt;stuff&amp;gt;.fits &lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;OK, I think we should follow this. I would propose that we use a leading word &apos;sim_&apos; for simulation reruns. For example, in our case:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt; /path/to/dataRepo/rerun/sim_prototype_v2/&amp;lt;tract&amp;gt;/&amp;lt;patch&amp;gt;/pfsObject-&amp;lt;stuff&amp;gt;.fits &lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;This would allow us to distinguish between real runs from simulated ones. I understand that the data model is subject for changes (e.g., I would expect something like &apos;deepCoadd/&apos; somewhere in the directory structure), but is there a way to indicate which rerun is based on which version of the data model?&#160; If not, perhaps we should invent one?&lt;/p&gt;</comment>
                            <comment id="14620" author="price" created="Mon, 19 Nov 2018 14:43:21 +0000"  >&lt;p&gt;You can encode whatever information into the rerun name you want, but I would advise against making it complicated as the rerun name is based on policy only, so it&apos;s not guaranteed to be always be what you want.&lt;/p&gt;

&lt;p&gt;The LSST code writes the version information as the &lt;tt&gt;packages&lt;/tt&gt; dataset. So it&apos;s not possible to tell the version that was used simply by looking at the directory structure, but it requires a little bit of python code.&lt;/p&gt;</comment>
                            <comment id="14622" author="msyktnk" created="Mon, 19 Nov 2018 23:52:54 +0000"  >&lt;p&gt;We will adopt&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt; /path/to/dataRepo/rerun/sim_prototype_v2/&amp;lt;tract&amp;gt;/&amp;lt;patch&amp;gt;/pfsObject-&amp;lt;stuff&amp;gt;.fits&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;&#160;&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                            <customfield id="customfield_10500" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10006" key="com.pyxis.greenhopper.jira:gh-epic-link">
                        <customfieldname>Epic Link</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>SCIDB-64</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                <customfield id="customfield_10010" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>0|s001q0:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                        </customfields>
    </item>
</channel>
</rss>