Ticket #388 (closed task: fixed)

Opened 5 years ago

Last modified 5 years ago

elements we need to see in an article xml feed

Reported by: russ Assigned to: russ
Priority: high Milestone:
Component: topaz Version:
Keywords: xml feed atom Cc:
Blocking: Blocked By:

Description

some of which may not fit in an atom feed.

looking at the existing RSS tool, it looks like the only thing we're missing from the RSS feed that exists in the generic XML feed are the subject categories.

we are currently supplying google scholar with subject categories, but it's not required so we could live without this for a bit.

if there's a way to handle subject categories in an atom feed that would be ideal.

we do need a way to grab a generic xml feed for articles and their metadata (and the current RSS tool does provide this), so if this is impossible for 0.7, please put it on the schedule for the next release.

current RSS output example:

[plos@plosweb01 ~]$ /usr/local/topaz/bin/rss -baseURL http://plostopaz01.localdomain -rss
...
   <channel>
      <title>PLoS ONE Alerts</title>
      <link>http://www.plosone.org</link>
      <image>
         <url>http://www.plosone.org/images/pone_favicon.ico</url>
         <title>PLoS ONE Alerts</title>
         <link>http://www.plosone.org</link>
      </image>
      <description>PLoS ONE Journal</description>
      <item>
         <title>Mutants in the Mouse NuRD/Mi2 Component P66α Are Embryonic Lethal</title>
         <pubDate>Wed, 13 Jun 2007 00:00:00 GMT</pubDate>
         <link>http://www.plosone.org/article/fetchArticle.action?articleURI=info:doi/10.1371/journal.pone.0000519</link>
         <description>BackgroundThe NuRD/Mi2 chromatin complex is involved in histone modifications and contains a large number of subunits, including the p66 protein. There are two mouse and human p66 paralogs, p66α and p66β.
The functions of these genes are not clear, in part because there are no mutants available, except in invertebrate model systems.MethodologyWe made loss of function mutants in the mouse p66α gene (mp66α, official name Gatad2a,
MGI:2384585). We found that mp66α is essential for development, as mutant embryos die around day 10 of embryogenesis. The gene is not required for normal blastocyst development or for implantation. The phenotype of mutant embry
os and the pattern of gene expression in mutants are consistent with a role of mp66α in gene silencing.Conclusionmp66α is an essential gene, required for early mouse development. The lethal phenotype supports a role in executio
n of methylated DNA silencing.</description>
         <author>Susan Marino, Roel Nusse</author>
      </item>

generic XML example:

[plos@plosweb01 ~]$ /usr/local/topaz/bin/rss -baseURL http://plostopaz01.localdomain
<articles>
  <article>
    <uri>info:doi/10.1371/journal.pone.0000519</uri>
    <title>Mutants in the Mouse NuRD/Mi2 Component P66α Are Embryonic Lethal</title>
    <description><sec><title>Background</title><p>The NuRD/Mi2 chromatin complex is involved in histone modifications and contains a large number of subunits, including the p66 protein. There are two mouse and human p66 paralog
s, p66α and p66β. The functions of these genes are not clear, in part because there are no mutants available, except in invertebrate model systems.</p></sec><sec><title>Methodology</title><p>We made loss of function mutants in
the mouse p66α gene (mp66α, official name Gatad2a, MGI:2384585). We found that mp66α is essential for development, as mutant embryos die around day 10 of embryogenesis. The gene is not required for normal blastocyst development
 or for implantation. The phenotype of mutant embryos and the pattern of gene expression in mutants are consistent with a role of mp66α in gene silencing.</p></sec><sec><title>Conclusion</title><p>mp66α is an essential gene, re
quired for early mouse development. The lethal phenotype supports a role in execution of methylated DNA silencing.</p></sec></description>
    <date>2007-06-13</date>
    <authors>
      <author>Susan Marino</author>
      <author>Roel Nusse</author>
    </authors>
    <categories>
      <category>Developmental Biology</category>
      <category>Genetics and Genomics</category>
    </categories>
    <subjects>
      <subject>Developmental Biology</subject>
      <subject>Developmental Biology/Embryology</subject>
      <subject>Genetics and Genomics/Epigenetics</subject>
    </subjects>
  </article>

}}}

Dependency Graph

Attachments

Feed.atom (78.6 kB) - added by amit on 06/14/07 17:29:46.
Atom feed returned by executing the query ' wget http://dobby.topazproject.org:8080/article/feed?maxResults=20'

Change History

06/14/07 15:51:31 changed by russ

once we get the next batch of rpms, i'll test the atom feeds to make sure that they have everything we need for alerts and google scholar, and i'll update this ticket with anything that's missing.

06/14/07 16:07:04 changed by amit

  • owner changed from somebody to jsuttor.

Thanks Russ.

Jeff, I don't think this should be an issue, but will let you confirm if we are returning categories with the ATOM feed.

06/14/07 16:15:35 changed by amit

I just found this ATOM Category so suspect this will not be an issue. I will let Jeff confirm if we are extracting that information.

06/14/07 16:39:43 changed by amit

XML Elements ATOM Elements
articles Not needed (no subnesting)
article <atom:entry>
uri <atom:link>
title <atom:title>
description <atom:summary>
date <atom:modified>, <atom:issued>, <atom:created>
authors Not needed (no subnesting)
author <atom:author>
categories Not needed (no subnesting)
category <atom:category>
subjects Not needed (no subnesting)
subject I think this can be done with <atom:category> with 'scheme'

06/14/07 17:28:54 changed by amit

I think it will work without modifications. Here is the query I did:

wget http://dobby.topazproject.org:8080/article/feed?maxResults=20

And am attaching the resulting file. Note: I am not sure about the precise field matches but am asusming that Rhone is doing the right thing (as Content seems suspect to me).

06/14/07 17:29:46 changed by amit

  • attachment Feed.atom added.

Atom feed returned by executing the query ' wget http://dobby.topazproject.org:8080/article/feed?maxResults=20'

06/14/07 17:31:39 changed by amit

  • owner changed from jsuttor to russ.

Russ, you will have to modify your XSLT, but I think you have all the information you need. Please take a look and let us know.

06/18/07 17:04:03 changed by russ

  • status changed from new to closed.
  • resolution set to fixed.

we've got what we need. i've opened a new ticket suggesting future refinements to the category XML.

10/29/07 21:13:36 changed by

  • milestone deleted.

Milestone 0.7 deleted