#CheckList for Xml Pojects
#XML CAR CHECKLIST
Agar aap CAR XML project par kaam kar rahe ho, to conversion tool ke baad XML file me kaafi errors aate hain jo manually fix karne padte hain.
Is post me main step-by-step checklist share kar raha hu jo XML file ko QA pass karane ke liye follow karni hoti hai.
1️⃣ Check File Type (AIP ya FINAL)
Sabse pehle file ka type check karo.
AIP File: aip-xxxxx
XML me ye hona chahiye:
<unit type="ARTICLE-IN-PRESS">
<status state="new" stage="S200"/>
FINAL File: final-xxxxx
<unit type="BATCH">
<status state="new" stage="S300"/>
2️⃣ Check Citation Type:
Source XML me article-type check karo.
and also check IPUI and Tpa Id.
<citation-type code="ar"/>
you can check here:
Type | Article-type from Source | PIT Code |
|---|---|---|
Article | research-article | ar |
review-article | re | |
case-report | ar | |
brief-report | ar | |
announcement | ar | |
Editorial | editorial | ed |
Introduction | ed | |
Letter | letter | le |
Letter to the Editor | le | |
Letters to the Editor | le | |
Correction | correction | er |
corrected-article | er | |
CORRECTIONS | er | |
Erratum | er | |
Expression of Concern | er | |
Other | abstract | cb |
Consensus Conference | cb | |
calendar | cal | |
news | nws | |
other | mis | |
retraction | ret | |
Discarded | List of contributors, Front Matter, Copyright, Obituary, Book review, Editorial Board | — |
3️⃣ Check DOI
DOI source XML se match hona chahiye.
Example:
<ce:doi>10.1097/ASW.0000000000000335</ce:doi>
4️⃣ Fix Author Group and Sequence
Author sequence continuous hona chahiye.
Example:
author-group
├── author
│ ├── initials
│ ├── surname
│ └── given-name
│
└── affiliation
├── organization
├── city
├── state
└── country
<!-- Author Group Start -->
<author-group seq="1">
<!-- Author 1 Start -->
<author seq="1">
<ce:initials>Y.</ce:initials>
<ce:surname>Fukuta</ce:surname>
<ce:given-name>Yuriko</ce:given-name>
<ce:e-address>yuriko.fukuta@bcm.edu</ce:e-address>
</author>
<!-- Author 1 End -->
<!-- Affiliation for Author Group -->
<affiliation>
<organization>Department of Medicine</organization>
<organization>Section of Infectious Diseases</organization>
<organization>Baylor College of Medicine</organization>
<city>Houston</city>
<state>Texas</state>
<country iso-code="USA"/>
<ce:source-text>
Department of Medicine, Section of Infectious Diseases,
Baylor College of Medicine, Houston, Texas, USA
</ce:source-text>
</affiliation>
<!-- Affiliation End -->
</author-group>
<!-- Author Group End -->
5️⃣ Fix Initials
intials generate from given name->
Examples:
MM → M.M.
R-D → R.-D.
FAA → F.A.A.
Initials me dot hona zaroori hai.
Incorrect:
<ce:initials>A</ce:initials>
Correct
<ce:initials>A.</ce:initials>
6️⃣ Fix Affiliations
For USA / Canada / Australia:
<organization>× N — all org/dept names<address-part>— any leftover info that's not city/state<city>— explicit city<state>— if present<postal-code>— if present<country iso-code="..."/>— always
For Rest of World:
<organization>× N — all org/dept names<address-part>— city, region, everything except postal code<postal-code>— if present<country iso-code="..."/>— always
<pt>— remove everywhere, it's a problem tag<institution content-type="...">— remove, replace with<organization><ce:source-text>— keep as-is, no changesCorrespondence affiliation — same structure, no
<ce:source-text>Postal codes — never inside
<state>or<address-part>, always in<postal-code>tag
you can see here:
<affiliation>
<organization>Department of Medicine</organization>
<organization>Section of Infectious Diseases</organization>
<organization>Baylor College of Medicine</organization>
<city>Houston</city>
<state>Texas</state>
<country iso-code="USA"/>
<ce:source-text>Department of Medicine, Section of Infectious Diseases, Baylor College of Medicine, Houston, Texas, USA</ce:source-text>
</affiliation>
For other countries:
<!-- Affiliation Start -->
<affiliation>
<organization>Department of Oncology</organization>
<organization>University of Copenhagen</organization>
<address-part>Copenhagen</address-part>
<country iso-code="DNK"/>
<ce:source-text>Department of Oncology, University of Copenhagen, Copenhagen, Denmark</ce:source-text>
</affiliation>
<!-- Affiliation End -->
7️⃣ Fix Copyright
Input XML se copyright lena hota hai.
and xml code me paste kar do:
<publishercopyright>
© 2025 American Society of Clinical Oncology
</publishercopyright>
8️⃣ Remove Bold and Italic Tags
PDF se aane wale tags remove karne hote hain.
<sup> superscript not change.
<sub> subscript replace with <inf>.
<bold>
<undrline>
<italic>
<ext-link>
Correct structure:
<ce:para>
Results of the study...
</ce:para>
9️⃣ Fix Volume and Issue Number
order.xml se check karo.
Example:
<volume-issue-number>
<vol-first>38</vol-first>
<iss-first>8</iss-first>
</volume-issue-number>
o pehle order.xml me ja kar check karo.
Agar order.xml me bhi empty hai, to is tag ko remove kar do.
CAR XML me empty tag allowed nahi hota.
Example
❌ Incorrect
<volume-issue-number>
<vol-first>38</vol-first>
<iss-first></iss-first>
</volume-issue-number>
✔ Correct
<volume-issue-number>
<vol-first>38</vol-first>
</volume-issue-number>2️⃣ Fix <vol-first> Text Value
Kabhi-kabhi <vol-first> me number ki jagah text value milti hai.
Example
<vol-first>Ahead of Print</vol-first>Isko fix karna hota hai.
Rule:
vol-first → vol-text✔ Correct Format
<vol-text>Ahead of Print</vol-text>Example
❌ Incorrect
<vol-first>Ahead of this</vol-first>✔ Correct
<vol-text>Ahead of this</vol-text>3️⃣ Fix <iss-first> with Supplement Value
Kabhi-kabhi <iss-first> me supplement value milti hai.
Example
<iss-first>16_suppl</iss-first>Isko supplement tag me convert karna hota hai.
Incorrect
<volume-issue-number>
<vol-first>43</vol-first>
<iss-first>16_suppl</iss-first>
</volume-issue-number>Correct
<volume-issue-number>
<vol-first>43</vol-first>
<suppl>16_suppl</suppl>
</volume-issue-number>Quick Rule Summary
Situation | Action |
|---|---|
| Remove the tag |
| Convert to |
| Move value to |
🔟 Fix Page Information
Case 1
Agar sirf first page ho
E69to use article number bana do. and isko </volispage> tag ke nichhe rkh do.
<article-number>E69</article-number>Case 2
Agar first page + last page dono ho
to same rehne do.
<first-page>E69</first-page>
<last-page>E72</last-page>1️⃣1️⃣ Fix Publication Date
AIP file
<publicationyear first="2025"/>FINAL file
<publicationyear first="2025"/>
<publicationdate>
<year>2025</year>
<month>09</month>
<day>01</day>
<date-text>September 2025</date-text>
</publicationdate>Rule
Day = 01 → date-text allowed1️⃣2️⃣ Fix References
Reference part me mostly errors aate hain.
Fix initials
Incorrect
Mitja OCorrect
<ce:initials>O.</ce:initials>Fix website link:
Note: after fix this error isko refrence part me jha pr </ref-info> tag close hoga uske just phle rkh dena hai.
Fix <ext-link> in Reference Part
Kabhi-kabhi reference section me ye structure milta hai:
<pt>
<ext-link>https://www.cdc.gov/mpox/hcp/clinical-care/index.html</ext-link>
</pt>Ye CAR XML format me correct nahi hota, isko fix karna padta hai.
Rule
<pt><ext-link> → convert to ref-website structure✔ Correct
<ref-website>
<ce:e-address type="url">
https://www.cdc.gov/mpox/hcp/clinical-care/index.html
</ce:e-address>
</ref-website>Quick Rule Summary
Situation | Fix |
|---|---|
| Convert to |
Website link in reference | Use |
1️⃣3. Fix Grant Coding
Create grant coding only if funding information is present (usually in acknowledgement or footnote)
Grant Coding Rules
Grant coding is created only for FINAL files
Do not create grant coding for AIP files
<grant-agency-id>must be taken from FundRef.rdf file
find these keywords: support / grant / funding / received in foot-note if present most probably create grant list:
you can follow this template:
<grantlist>
<grant>
<grant-id>XXXX</grant-id>
<grant-acronym>ABC</grant-acronym>
<grant-agency>
<organization>Funding Organization Name</organization>
<country iso-code="XXX"/>
</grant-agency>
<grant-agency-id>XXXX</grant-agency-id>
</grant>
<grant-text>Funding statement from article.</grant-text>
</grantlist>This prompt is used to generate grant information. Please follow these rules when creating any grant.
You are an expert in Elsevier CAR XML grant information extraction.
Your task is to extract all grant and funding support information
from a given article and structure it according to Elsevier CAR XML rules.
=== STEP 1: LOCATE SUPPORT INFORMATION ===
Scan the ENTIRE article for funding/support information in:
- Beginning and end of article
- Footnotes
- Acknowledgement sections
- Funding or financial support sections
Look for these keywords:
- "funded by"
- "supported by"
- "grant"
- "grant number"
- "grant no."
- "financial support"
- "research supported by"
- "thank"
- Any similar funding/support expressions
=== STEP 2: EXTRACT GRANT DETAILS ===
For each grant found, extract the following
if explicitly present in the article:
1. grant-id
- Grant number, project number, or award ID
- ONLY capture if explicitly stated in article
- NEVER fabricate or assume
- NIFA, project numbers, award IDs qualify
- If NOT present, OMIT <grant-id> entirely
2. grant-acronym
- Funding agency acronym
- ONLY capture if present in FundRef Vocabulary
- If NOT present, OMIT <grant-acronym> entirely
3. organization
- Full standardized agency name
- MUST match FundRef Vocabulary standardized name
- When only acronym in article, use FundRef to
find full name
4. country iso-code
- Three-letter uppercase ISO country code
- e.g., USA, BEL, GBR, DEU, FRA
- MUST match FundRef Vocabulary country
5. grant-agency-id
- Elsevier fundingBodyId from FundRef Vocabulary
- Found in fundingBodyId column of FundRef
=== STEP 3: CORE RULES ===
GRANT ELEMENT RULES:
- ALWAYS create <grant> for EVERY funding agency found
- Create SEPARATE <grant> for each grant award
- If same agency has multiple grants, create separate
<grant> element for each grant
- <grant-id> is OPTIONAL, add ONLY if in article
- NEVER skip a <grant> element
- NEVER fabricate any information
GRANT TEXT RULES:
- Capture ALL funding/support text verbatim
- Use ONLY ONE <grant-text> element per <grantlist>
- Do NOT capture section or paragraph headings
- Include both monetary AND non-monetary support
- Include ALL acknowledgement text related to funding
- If multiple support statements exist, combine them
in order they appear, separated by ". " (full stop
and space)
GRANT AGENCY RULES:
- ALWAYS use FundRef Vocabulary for:
* Standardized agency names
* Agency acronyms
* fundingBodyId (grant-agency-id)
* Country codes
- Do NOT use FundRef for Elsevier published content
- Only capture agency names found in FundRef Vocabulary
GRANT ID RULES:
- Capture ONLY when explicitly present in article
- NEVER fabricate or assume grant numbers
- If no grant number mentioned, OMIT <grant-id>
- Examples of valid grant-id: NIFA, 654836,
FA9550-XX-X-XXXX, project numbers, award IDs
GENERAL RULES:
- Capture ONLY from published article text
- Do NOT follow external website links
- Do NOT assume or invent any information
- Do NOT capture Articles in Press (stage="S200")
- Grant information is NOT captured for
Articles in Press
=== STEP 4: OUTPUT FORMAT ===
<grantlist>
<grant>
<grant-id>[if explicitly present]</grant-id>
<grant-acronym>[if in FundRef]</grant-acronym>
<grant-agency>
<organization>[FundRef standardized name]</organization>
<country iso-code="[3-LETTER UPPERCASE ISO]"/>
</grant-agency>
<grant-agency-id>[FundRef fundingBodyId]</grant-agency-id>
</grant>
[repeat separate <grant> for each grant award]
<grant-text>[verbatim funding text from article]</grant-text>
</grantlist>
FORMATTING RULES:
- All tags LEFT-ALIGNED, no indentation
- Single <grantlist> wrapper
- <grant-text> always LAST inside <grantlist>
- Omit optional elements if not present
- Do not add empty elements
=== STEP 5: VALIDATION CHECKLIST ===
Before delivering final output verify:
[ ] All funding agencies from article captured
[ ] Separate <grant> created for each grant award
[ ] Agency names match FundRef Vocabulary
[ ] Country codes are 3-letter uppercase ISO
[ ] grant-agency-id matches FundRef fundingBodyId
[ ] <grant-id> only present if explicitly in article
[ ] <grant-id> omitted if not mentioned in article
[ ] Only ONE <grant-text> element
[ ] <grant-text> is verbatim from article
[ ] No section headings in <grant-text>
[ ] Tags are left-aligned
[ ] No fabricated information
[ ] No empty elements
[ ] Only published article content captured
[ ] External links not followedQuick Reference Card:
Element | Rule |
|---|---|
| Always create for every agency |
| Only if explicitly in article |
| Only if in FundRef Vocabulary |
| Must match FundRef standardized name |
| 3-letter uppercase ISO code |
| FundRef fundingBodyId number |
| Verbatim, single element, no headings |
Multiple grants | Separate |
Multiple texts | Combined with |
Empty elements | Never add |
External links | Never follow |
Fabricated info | Never add |
All country code Here avilable:
Country | 3-Letter Code | 2-Letter Code |
|---|---|---|
Afghanistan | AFG | AF |
Albania | ALB | AL |
Algeria | DZA | DZ |
American Samoa | ASM | AS |
Andorra | AND | AD |
Angola | AGO | AO |
Anguilla | AIA | AI |
Antarctica | ATA | AQ |
Antigua and Barbuda | ATG | AG |
Argentina | ARG | AR |
Armenia | ARM | AM |
Aruba | ABW | AW |
Australia | AUS | AU |
Austria | AUT | AT |
Azerbaijan | AZE | AZ |
Bahamas | BHS | BS |
Bahrain | BHR | BH |
Bangladesh | BGD | BD |
Barbados | BRB | BB |
Belarus | BLR | BY |
Belgium | BEL | BE |
Belize | BLZ | BZ |
Benin | BEN | BJ |
Bermuda | BMU | BM |
Bhutan | BTN | BT |
Bolivia | BOL | BO |
Bonaire | BES | BQ |
Bosnia and Herzegovina | BIH | BA |
Botswana | BWA | BW |
Bouvet Island | BVT | BV |
Brazil | BRA | BR |
British Indian Ocean Territory | IOT | IO |
Brunei Darussalam | BRN | BN |
Bulgaria | BGR | BG |
Burkina Faso | BFA | BF |
Burundi | BDI | BI |
Cambodia | KHM | KH |
Cameroon | CMR | CM |
Canada | CAN | CA |
Cape Verde | CPV | CV |
Cayman Islands | CYM | KY |
Central African Republic | CAF | CF |
Chad | TCD | TD |
Chile | CHL | CL |
China | CHN | CN |
Christmas Island | CXR | CX |
Cocos (Keeling) Islands | CCK | CC |
Colombia | COL | CO |
Comoros | COM | KM |
Congo | COG | CG |
Cook Islands | COK | CK |
Costa Rica | CRI | CR |
Cote d'Ivoire | CIV | CI |
Croatia | HRV | HR |
Cuba | CUB | CU |
Curacao | CUW | CW |
Cyprus | CYP | CY |
Czechia | CZE | CZ |
Democratic Republic Congo | COD | CD |
Denmark | DNK | DK |
Djibouti | DJI | DJ |
Dominica | DMA | DM |
Dominican Republic | DOM | DO |
Ecuador | ECU | EC |
Egypt | EGY | EG |
El Salvador | SLV | SV |
Equatorial Guinea | GNQ | GQ |
Eritrea | ERI | ER |
Estonia | EST | EE |
Ethiopia | ETH | ET |
European Union | EUE | EU |
Falkland Islands (Malvinas) | FLK | FK |
Faroe Islands | FRO | FO |
Federated States of Micronesia | FSM | FM |
Fiji | FJI | FJ |
Finland | FIN | FI |
France | FRA | FR |
French Guiana | GUF | GF |
French Polynesia | PYF | PF |
French Southern Territories | ATF | TF |
Gabon | GAB | GA |
Gambia | GMB | GM |
Georgia | GEO | GE |
Germany | DEU | DE |
Ghana | GHA | GH |
Gibraltar | GIB | GI |
Greece | GRC | GR |
Greenland | GRL | GL |
Grenada | GRD | GD |
Guadeloupe | GLP | GP |
Guam | GUM | GU |
Guatemala | GTM | GT |
Guinea | GIN | GN |
Guinea-Bissau | GNB | GW |
Guyana | GUY | GY |
Haiti | HTI | HT |
Heard Island and McDonald Islands | HMD | HM |
Honduras | HND | HN |
Hong Kong | HKG | HK |
Hungary | HUN | HU |
Iceland | ISL | IS |
India | IND | IN |
Indonesia | IDN | ID |
Iran | IRN | IR |
Iraq | IRQ | IQ |
Ireland | IRL | IE |
Israel | ISR | IL |
Italy | ITA | IT |
Jamaica | JAM | JM |
Japan | JPN | JP |
Jordan | JOR | JO |
Kazakhstan | KAZ | KZ |
Kenya | KEN | KE |
Kiribati | KIR | KI |
Kuwait | KWT | KW |
Kyrgyzstan | KGZ | KG |
Laos | LAO | LA |
Latvia | LVA | LV |
Lebanon | LBN | LB |
Lesotho | LSO | LS |
Liberia | LBR | LR |
Libyan Arab Jamahiriya | LBY | LY |
Liechtenstein | LIE | LI |
Lithuania | LTU | LT |
Luxembourg | LUX | LU |
Macao | MAC | MO |
North Macedonia | MKD | MK |
Madagascar | MDG | MG |
Malawi | MWI | MW |
Malaysia | MYS | MY |
Maldives | MDV | MV |
Mali | MLI | ML |
Malta | MLT | MT |
Marshall Islands | MHL | MH |
Martinique | MTQ | MQ |
Mauritania | MRT | MR |
Mauritius | MUS | MU |
Mayotte | MYT | YT |
Mexico | MEX | MX |
Moldova | MDA | MD |
Monaco | MCO | MC |
Mongolia | MNG | MN |
Montenegro | MNE | ME |
Montserrat | MSR | MS |
Morocco | MAR | MA |
Mozambique | MOZ | MZ |
Myanmar | MMR | MM |
Namibia | NAM | NA |
Nauru | NRU | NR |
Nepal | NPL | NP |
Netherlands | NLD | NL |
New Caledonia | NCL | NC |
New Zealand | NZL | NZ |
Nicaragua | NIC | NI |
Niger | NER | NE |
Nigeria | NGA | NG |
Niue | NIU | NU |
Norfolk Island | NFK | NF |
North Korea | PRK | KP |
Northern Mariana Islands | MNP | MP |
Norway | NOR | NO |
Oman | OMN | OM |
Pakistan | PAK | PK |
Palau | PLW | PW |
Palestine | PSE | PS |
Panama | PAN | PA |
Papua New Guinea | PNG | PG |
Paraguay | PRY | PY |
Peru | PER | PE |
Philippines | PHL | PH |
Pitcairn | PCN | PN |
Poland | POL | PL |
Portugal | PRT | PT |
Puerto Rico | PRI | PR |
Qatar | QAT | QA |
Reunion | REU | RE |
Romania | ROU | RO |
Russian Federation | RUS | RU |
Rwanda | RWA | RW |
Saba | BES | BQ |
Saint Eustatius | BES | BQ |
Saint Helena | SHN | SH |
Saint Kitts and Nevis | KNA | KN |
Saint Lucia | LCA | LC |
Saint Martin (Dutch) | SXM | SX |
Saint Martin (French) | MAF | MF |
Saint Pierre and Miquelon | SPM | PM |
Saint Vincent and the Grenadines | VCT | VC |
Samoa | WSM | WS |
San Marino | SMR | SM |
Sao Tome and Principe | STP | ST |
Saudi Arabia | SAU | SA |
Senegal | SEN | SN |
Serbia | SRB | RS |
Seychelles | SYC | SC |
Sierra Leone | SLE | SL |
Singapore | SGP | SG |
Slovakia | SVK | SK |
Slovenia | SVN | SI |
Solomon Islands | SLB | SB |
Somalia | SOM | SO |
South Africa | ZAF | ZA |
South Georgia and the South Sandwich Islands | SGS | GS |
South Korea | KOR | KR |
Spain | ESP | ES |
Sri Lanka | LKA | LK |
South Sudan | SSD | SS |
Sudan | SDN | SD |
Suriname | SUR | SR |
Svalbard and Jan Mayen | SJM | SJ |
Swaziland | SWZ | SZ |
Sweden | SWE | SE |
Switzerland | CHE | CH |
Syrian Arab Republic | SYR | SY |
Taiwan | TWN | TW |
Tajikistan | TJK | TJ |
Tanzania | TZA | TZ |
Thailand | THA | TH |
Timor-Leste | TLS | TL |
Togo | TGO | TG |
Tokelau | TKL | TK |
Tonga | TON | TO |
Trinidad and Tobago | TTO | TT |
Tunisia | TUN | TN |
Turkey | TUR | TR |
Turkmenistan | TKM | TM |
Turks and Caicos Islands | TCA | TC |
Tuvalu | TUV | TV |
Uganda | UGA | UG |
Ukraine | UKR | UA |
United Arab Emirates | ARE | AE |
United Kingdom | GBR | GB |
United States | USA | US |
United States Minor Outlying Islands | UMI | UM |
Uruguay | URY | UY |
Uzbekistan | UZB | UZ |
Vanuatu | VUT | VU |
Vatican City State | VAT | VA |
Venezuela | VEN | VE |
Viet Nam | VNM | VN |
Virgin Islands (British) | VGB | VG |
Virgin Islands (U.S.) | VIR | VI |
Wallis and Futuna | WLF | WF |
Western Sahara | ESH | EH |
Yemen | YEM | YE |
Zambia | ZMB | ZM |
Zimbabwe | ZWE | ZW |