<!-- 
RSS generated by JIRA (8.3.4#803005-sha1:1f96e09b3c60279a408a2ae47be3c745f571388b) at Sat Feb 10 15:59:50 JST 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>PFS-JIRA</title>
    <link>https://pfspipe.ipmu.jp/jira</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>8.3.4</version>
        <build-number>803005</build-number>
        <build-date>13-09-2019</build-date>
    </build-info>


<item>
            <title>[PIPE2D-920] How to put the large set of the AMBRE model templates in github for flux calibration</title>
                <link>https://pfspipe.ipmu.jp/jira/browse/PIPE2D-920</link>
                <project id="10002" key="PIPE2D">DRP 2-D Pipeline</project>
                    <description>&lt;p&gt;Initially, we plan to store the 6k templates (2.4GB) in GitHub for flux calibration. Because this might be large, Mineo-san and Yamashita are discussing reducing the size.&lt;br/&gt;
 One option is to upload a small subset of them. This enables us to run a test code for the blue-part in flux calibration but the output is inaccurate. The full data will be uploaded sometime in the future.&lt;br/&gt;
 Another option is to compress the 6k templates. We are now discussing whether saving them in fixed-point numbers works well or not.&lt;br/&gt;
 &#160;&lt;br/&gt;
 We have ~6k templates. In the current plan, a user generates a large template set (5.6k) with smaller parameter grids for stellar typing using the 6k templates and RBF interpolation in the f_star repository, on a user-side. Stellar typing for flux calibration refers to the 5.6k templates. 1 template (1 FITS file) has ~400kB. The size of the first template set of 6k is 6k*400kB = 2.4 GB in total.&lt;br/&gt;
 &#160;&lt;/p&gt;</description>
                <environment></environment>
        <key id="18290">PIPE2D-920</key>
            <summary>How to put the large set of the AMBRE model templates in github for flux calibration</summary>
                <type id="10001" iconUrl="https://pfspipe.ipmu.jp/jira/secure/viewavatar?size=xsmall&amp;avatarId=10515&amp;avatarType=issuetype">Story</type>
                                            <priority id="10000" iconUrl="https://pfspipe.ipmu.jp/jira/images/icons/priorities/medium.svg">Normal</priority>
                        <status id="10002" iconUrl="https://pfspipe.ipmu.jp/jira/images/icons/statuses/generic.png" description="The issue is resolved, reviewed, and merged">Done</status>
                    <statusCategory id="3" key="done" colorName="green"/>
                                    <resolution id="10000">Done</resolution>
                                        <assignee username="sogo.mineo">sogo.mineo</assignee>
                                    <reporter username="takuji.yamashita">Takuji Yamashita</reporter>
                        <labels>
                            <label>flux-calibration</label>
                            <label>model-templates</label>
                    </labels>
                <created>Mon, 25 Oct 2021 07:06:20 +0000</created>
                <updated>Wed, 22 Dec 2021 00:22:58 +0000</updated>
                            <resolved>Wed, 22 Dec 2021 00:22:57 +0000</resolved>
                                                                        <due></due>
                            <votes>0</votes>
                                    <watches>6</watches>
                                                                <comments>
                            <comment id="24788" author="takuji.yamashita" created="Tue, 26 Oct 2021 00:46:31 +0000"  >&lt;p&gt;We lean to saving the 6k templates in fixed-point numbers to reduce the file size. The size could reduce roughly by half, ~1-2GB. We will convert the templates to log and then convert them to fixed-point numbers. We need to test the accuracy.&lt;/p&gt;</comment>
                            <comment id="24789" author="price" created="Tue, 26 Oct 2021 00:48:04 +0000"  >&lt;p&gt;We should also use WCS to do the wavelengths if we can.&lt;/p&gt;</comment>
                            <comment id="24790" author="takuji.yamashita" created="Tue, 26 Oct 2021 00:57:52 +0000"  >&lt;p&gt;The spectra are saved in FITS again. We can use WCS for wavelengths.&#160;&lt;/p&gt;</comment>
                            <comment id="24791" author="sogo.mineo" created="Tue, 26 Oct 2021 01:11:54 +0000"  >&lt;p&gt;The size estimate appearing in the description does not include wavelength column. FITS files contain flux only. Wavelengths are computed by means of WCS.&lt;/p&gt;</comment>
                            <comment id="24804" author="hassan" created="Tue, 26 Oct 2021 15:06:23 +0000"  >&lt;p&gt;Missing important comment from @rhl: &lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;As we may need to do something cleverer someday, I&apos;d hide it behind an API&lt;/p&gt;&lt;/blockquote&gt;

&lt;p&gt;That way, the access and processing software are decoupled from the stored data format.&lt;/p&gt;</comment>
                            <comment id="24806" author="hassan" created="Tue, 26 Oct 2021 15:57:13 +0000"  >&lt;p&gt;Discussed this a little further with &lt;a href=&quot;https://pfspipe.ipmu.jp/jira/secure/ViewProfile.jspa?name=rhl&quot; class=&quot;user-hover&quot; rel=&quot;rhl&quot;&gt;rhl&lt;/a&gt; and &lt;a href=&quot;https://pfspipe.ipmu.jp/jira/secure/ViewProfile.jspa?name=price&quot; class=&quot;user-hover&quot; rel=&quot;price&quot;&gt;price&lt;/a&gt;. Is it possible to store the data on a server somewhere? That would be easier to manage than to store the data under git-lfs. &lt;/p&gt;</comment>
                            <comment id="24813" author="sogo.mineo" created="Tue, 26 Oct 2021 23:52:39 +0000"  >&lt;p&gt;We can indeed put the heavy things in hscdata.mtk.nao.ac.jp, for example. The problem is how to let the test process see them. I would like to hook the first call to &lt;tt&gt;getModelSpectrum()&lt;/tt&gt; and get all spectra downloaded into ... some directory. I don&apos;t want to use &lt;tt&gt;/tmp&lt;/tt&gt; since astropy does this and fills up the limited capacity of &lt;tt&gt;/tmp&lt;/tt&gt; all too soon, then killed and leaving the system unstable. Another solution might be to make valid a path starting with &lt;tt&gt;https://&lt;/tt&gt; and download the models one by one every time they are requested, just as we actually open and read the model files one by one every time they are requested,&lt;/p&gt;</comment>
                            <comment id="24825" author="sogo.mineo" created="Wed, 27 Oct 2021 06:11:15 +0000"  >&lt;p&gt;I would like to take the last route (&quot;Another solution might be to make valid a path...&quot;) because it is the easiest thing to do. If I take this route, it may be that we no longer have to reduce the model size but that we can use all of 60k models, which was Yamashita-san&apos;s first plan. One problem is that the task of making calibration references will take a few hours (even if the models are in a local storage) to process a single fiber, so that it will take at least a few hours for a unit test to be proved.&lt;/p&gt;</comment>
                            <comment id="24829" author="rhl" created="Wed, 27 Oct 2021 15:09:09 +0000"  >&lt;p&gt;I was assuming that the test data would just be a dependency, so it&apos;d be installed once (using as it were &lt;tt&gt;curl&lt;/tt&gt;) and then used whenever you run the tests.&lt;/p&gt;</comment>
                            <comment id="24852" author="sogo.mineo" created="Thu, 28 Oct 2021 05:51:08 +0000"  >&lt;p&gt;Then what I have to do are:&lt;/p&gt;
&lt;ol&gt;
	&lt;li&gt;Upload the 60k (or smaller 6k) spectra to some http server. (The package might need &lt;tt&gt;ups/&lt;/tt&gt; directory)&lt;/li&gt;
	&lt;li&gt;Ask people (I don&apos;t know who) to install it, as a dependency, into the server in which automatic tests are run.&lt;/li&gt;
	&lt;li&gt;Push branches that require the package.&lt;/li&gt;
&lt;/ol&gt;


&lt;p&gt;Do I understand correctly?&lt;/p&gt;</comment>
                            <comment id="24892" author="price" created="Mon, 1 Nov 2021 14:59:26 +0000"  >&lt;p&gt;Yes, that&apos;s great. Please be sure to include a &lt;tt&gt;README&lt;/tt&gt; file that explains what the data are and where they came from, and include a version string (usually the date) in the directory name.&lt;/p&gt;

&lt;p&gt;You&apos;re welcome to put it on the tiger cluster at Princeton (&lt;tt&gt;e.g., /projects/HSC/PFS/fluxCal/fluxCal-20211101&lt;/tt&gt;), and we can serve it via http from there.&lt;/p&gt;</comment>
                            <comment id="24917" author="sogo.mineo" created="Tue, 2 Nov 2021 03:14:04 +0000"  >&lt;p&gt;I tentatively created a package just now, but I found myself not sure whether or not the synthetic spectra are redistributable. I am now checking it.&lt;br/&gt;
The package name will be &lt;tt&gt;fluxmodeldata-ambre-20210512-full&lt;/tt&gt; in which &lt;tt&gt;ambre-20210512-full&lt;/tt&gt; is its version name.&lt;/p&gt;</comment>
                            <comment id="25176" author="sogo.mineo" created="Mon, 22 Nov 2021 02:10:42 +0000"  >&lt;p&gt;Tanaka-san said we have been given permission of redistribution of the synthetic spectra by the author. Yamashita-san found some flaws in converting the original spectra to the format he uses. He is now recreating the data files.&lt;/p&gt;</comment>
                            <comment id="25693" author="sogo.mineo" created="Mon, 6 Dec 2021 08:16:09 +0000"  >&lt;p&gt;I have uploaded the smaller dataset here &lt;a href=&quot;https://hscdata.mtk.nao.ac.jp/hsc_bin_dist/pfs/fluxmodeldata-ambre-20190419-small.tar.xz&quot; class=&quot;external-link&quot; rel=&quot;nofollow&quot;&gt;https://hscdata.mtk.nao.ac.jp/hsc_bin_dist/pfs/fluxmodeldata-ambre-20190419-small.tar.xz&lt;/a&gt;&#160;.&lt;br/&gt;
I intend this package to be used by unit tests because Yamashita-san&apos;s algorithm to make flux reference takes several hours per fiber when the full dataset is used.&lt;/p&gt;

&lt;p&gt;The full dataset has not been completed yet. We found that we had to add more spectra to the dataset, and the spectra have yet to be made. We must also examine whether the tremendous amount of the full dataset and the eon-long execution time really contribute to accuracy of the calibration task.&lt;/p&gt;</comment>
                            <comment id="25695" author="sogo.mineo" created="Tue, 7 Dec 2021 02:10:28 +0000"  >&lt;p&gt;If the above dataset is approved and installed in the server where the tests run, I would like to push changes to &lt;tt&gt;drp_stella&lt;/tt&gt; and &lt;tt&gt;drp_pfs_data&lt;/tt&gt; that are named this issue. With the changes, the broadband photometry table referred to by &lt;tt&gt;FitBroadbandSEDTask&lt;/tt&gt; is moved from &lt;tt&gt;drp_pfs_data&lt;/tt&gt; to the above package. The broadband photometry table must reside close to the spectrum set because the two must match with each other.&lt;/p&gt;</comment>
                            <comment id="25729" author="takuji.yamashita" created="Tue, 14 Dec 2021 08:07:34 +0000"  >&lt;p&gt;We can close this ticket because we have discussed this issue and Mineo-san has loaded the model template dataset. I will file two new tickets for the works Mineo-san said above:&lt;br/&gt;
1. check and approve the dataset that Mineo-san has uploaded, and then install it on the server.&lt;br/&gt;
2. move the broad-band photometry table to the dataset package&lt;br/&gt;
May I assign the 1st one to &lt;a href=&quot;https://pfspipe.ipmu.jp/jira/secure/ViewProfile.jspa?name=price&quot; class=&quot;user-hover&quot; rel=&quot;price&quot;&gt;price&lt;/a&gt; and the 2nd one to &lt;a href=&quot;https://pfspipe.ipmu.jp/jira/secure/ViewProfile.jspa?name=sogo.mineo&quot; class=&quot;user-hover&quot; rel=&quot;sogo.mineo&quot;&gt;sogo.mineo&lt;/a&gt;&#160;?&lt;/p&gt;</comment>
                            <comment id="30208" author="price" created="Fri, 17 Dec 2021 16:11:24 +0000"  >&lt;p&gt;I&apos;ve retrieved the tarball listed above, and placed it in &lt;tt&gt;/projects/HSC/PFS/fluxCal&lt;/tt&gt; on our Tiger cluster. It looks good to me.&lt;/p&gt;</comment>
                            <comment id="30215" author="sogo.mineo" created="Mon, 20 Dec 2021 07:03:10 +0000"  >&lt;p&gt;I have made two PR, one of which is to make &lt;tt&gt;drp_stella&lt;/tt&gt; depend on &lt;tt&gt;fluxmodeldata&lt;/tt&gt; package. The other one is to remove the photometry table from &lt;tt&gt;drp_pfs_data&lt;/tt&gt;. The former change should be made before the latter change. Could you review these things?&lt;/p&gt;</comment>
                            <comment id="30216" author="price" created="Mon, 20 Dec 2021 16:28:26 +0000"  >&lt;p&gt;I don&apos;t think we can require that every installation of the pipeline contains a 2.4 GB data package. You should make the data package &lt;tt&gt;setupOptional&lt;/tt&gt; in the table file, and protect the tests with checks, e.g., &lt;a href=&quot;https://github.com/lsst/afw/blob/main/tests/test_maskedImageIO.py#L100&quot; class=&quot;external-link&quot; rel=&quot;nofollow&quot;&gt;here&lt;/a&gt;.&lt;/p&gt;</comment>
                            <comment id="30230" author="sogo.mineo" created="Tue, 21 Dec 2021 06:23:27 +0000"  >&lt;p&gt;I changed &lt;tt&gt;fluxmodeldata&lt;/tt&gt; from required to optional. Could you review the newly pushed patch? &lt;/p&gt;</comment>
                            <comment id="30236" author="price" created="Tue, 21 Dec 2021 19:25:59 +0000"  >&lt;p&gt;Awesome, thanks!&lt;/p&gt;</comment>
                            <comment id="30240" author="sogo.mineo" created="Wed, 22 Dec 2021 00:22:58 +0000"  >&lt;p&gt;Thanks for the review. I merged my two pull requests to master.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10000">
                    <name>Blocks</name>
                                            <outwardlinks description="blocks">
                                        <issuelink>
            <issuekey id="18291">PIPE2D-921</issuekey>
        </issuelink>
                            </outwardlinks>
                                                        </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                            <customfield id="customfield_10500" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10006" key="com.pyxis.greenhopper.jira:gh-epic-link">
                        <customfieldname>Epic Link</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>PIPE2D-303</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                <customfield id="customfield_10010" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>0|zzsglb:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10100" key="com.atlassian.jira.plugin.system.customfieldtypes:userpicker">
                        <customfieldname>Reviewers</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>price</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                </customfields>
    </item>
</channel>
</rss>