Created
Mar 6, 2026
Last Modified
1 week ago

#CheckList for Xml Pojects

#XML CAR CHECKLIST

Agar aap CAR XML project par kaam kar rahe ho, to conversion tool ke baad XML file me kaafi errors aate hain jo manually fix karne padte hain.
Is post me main step-by-step checklist share kar raha hu jo XML file ko QA pass karane ke liye follow karni hoti hai.

1️⃣ Check File Type (AIP ya FINAL)

Sabse pehle file ka type check karo.

AIP File: aip-xxxxx

XML me ye hona chahiye:

xml
<unit type="ARTICLE-IN-PRESS">
<status state="new" stage="S200"/>

FINAL File: final-xxxxx

xml
<unit type="BATCH">
<status state="new" stage="S300"/>

2️⃣ Check Citation Type:

Source XML me article-type check karo.

and also check IPUI and Tpa Id.

xml
<citation-type code="ar"/>

you can check here:

Type

Article-type from Source

PIT Code

Article

research-article

ar

review-article

re

case-report

ar

brief-report

ar

announcement

ar

Editorial

editorial

ed

Introduction

ed

Letter

letter

le

Letter to the Editor

le

Letters to the Editor

le

Correction

correction

er

corrected-article

er

CORRECTIONS

er

Erratum

er

Expression of Concern

er

Other

abstract

cb

Consensus Conference

cb

calendar

cal

news

nws

other

mis

retraction

ret

Discarded

List of contributors, Front Matter, Copyright, Obituary, Book review, Editorial Board

3️⃣ Check DOI

DOI source XML se match hona chahiye.

Example:

xml
<ce:doi>10.1097/ASW.0000000000000335</ce:doi>

4️⃣ Fix Author Group and Sequence

Author sequence continuous hona chahiye.

Example:

plaintext
author-group
   ├── author
   │      ├── initials
   │      ├── surname
   │      └── given-name
   │
   └── affiliation
          ├── organization
          ├── city
          ├── state
          └── country
xml
<!-- Author Group Start -->
<author-group seq="1">

    <!-- Author 1 Start -->
    <author seq="1">
        <ce:initials>Y.</ce:initials>
        <ce:surname>Fukuta</ce:surname>
        <ce:given-name>Yuriko</ce:given-name>
        <ce:e-address>yuriko.fukuta@bcm.edu</ce:e-address>
    </author>
    <!-- Author 1 End -->

    <!-- Affiliation for Author Group -->
    <affiliation>
        <organization>Department of Medicine</organization>
        <organization>Section of Infectious Diseases</organization>
        <organization>Baylor College of Medicine</organization>

        <city>Houston</city>
        <state>Texas</state>
        <country iso-code="USA"/>

        <ce:source-text>
            Department of Medicine, Section of Infectious Diseases,
            Baylor College of Medicine, Houston, Texas, USA
        </ce:source-text>
    </affiliation>
    <!-- Affiliation End -->

</author-group>
<!-- Author Group End -->

5️⃣ Fix Initials

intials generate from given name->

Examples:

xml
MM → M.M.
R-D → R.-D.
FAA → F.A.A.

Initials me dot hona zaroori hai.

Incorrect:

xml
<ce:initials>A</ce:initials>

Correct

xml
<ce:initials>A.</ce:initials>

6️⃣ Fix Affiliations

  1. For USA / Canada / Australia:

    • <organization> × N — all org/dept names

    • <address-part> — any leftover info that's not city/state

    • <city> — explicit city

    • <state> — if present

    • <postal-code> — if present

    • <country iso-code="..."/> — always

  2. For Rest of World:

    • <organization> × N — all org/dept names

    • <address-part> — city, region, everything except postal code

    • <postal-code> — if present

    • <country iso-code="..."/> — always

  3. <pt> — remove everywhere, it's a problem tag

  4. <institution content-type="..."> — remove, replace with <organization>

  5. <ce:source-text> — keep as-is, no changes

  6. Correspondence affiliation — same structure, no <ce:source-text>

  7. Postal codes — never inside <state> or <address-part>, always in <postal-code> tag

you can see here:

xml
<affiliation>
<organization>Department of Medicine</organization>
<organization>Section of Infectious Diseases</organization>
<organization>Baylor College of Medicine</organization>
<city>Houston</city>
<state>Texas</state>
<country iso-code="USA"/>
<ce:source-text>Department of Medicine, Section of Infectious Diseases, Baylor College of Medicine, Houston, Texas, USA</ce:source-text>
</affiliation>

For other countries:

xml
<!-- Affiliation Start -->
<affiliation>
<organization>Department of Oncology</organization>
<organization>University of Copenhagen</organization>
<address-part>Copenhagen</address-part>
<country iso-code="DNK"/>
<ce:source-text>Department of Oncology, University of Copenhagen, Copenhagen, Denmark</ce:source-text>
</affiliation>
<!-- Affiliation End -->

Input XML se copyright lena hota hai.

and xml code me paste kar do:

xml
<publishercopyright>
© 2025 American Society of Clinical Oncology
</publishercopyright>

8️⃣ Remove Bold and Italic Tags

PDF se aane wale tags remove karne hote hain.

<sup> superscript not change.

<sub> subscript replace with <inf>.

plaintext
<bold>
<undrline>
<italic>
<ext-link>

Correct structure:

xml
<ce:para>
Results of the study...
</ce:para>

9️⃣ Fix Volume and Issue Number

order.xml se check karo.

Example:

xml
<volume-issue-number>
<vol-first>38</vol-first>
<iss-first>8</iss-first>
</volume-issue-number>

o pehle order.xml me ja kar check karo.

  • Agar order.xml me bhi empty hai, to is tag ko remove kar do.

  • CAR XML me empty tag allowed nahi hota.

Example

❌ Incorrect

xml
<volume-issue-number>
<vol-first>38</vol-first>
<iss-first></iss-first>
</volume-issue-number>

✔ Correct

xml
<volume-issue-number>
<vol-first>38</vol-first>
</volume-issue-number>

2️⃣ Fix <vol-first> Text Value

Kabhi-kabhi <vol-first> me number ki jagah text value milti hai.

Example

plaintext
<vol-first>Ahead of Print</vol-first>

Isko fix karna hota hai.

Rule:

plaintext
vol-first → vol-text

✔ Correct Format

plaintext
<vol-text>Ahead of Print</vol-text>

Example

❌ Incorrect

plaintext
<vol-first>Ahead of this</vol-first>

✔ Correct

plaintext
<vol-text>Ahead of this</vol-text>

3️⃣ Fix <iss-first> with Supplement Value

Kabhi-kabhi <iss-first> me supplement value milti hai.

Example

plaintext
<iss-first>16_suppl</iss-first>

Isko supplement tag me convert karna hota hai.

Incorrect

plaintext
<volume-issue-number>
<vol-first>43</vol-first>
<iss-first>16_suppl</iss-first>
</volume-issue-number>

Correct

plaintext
<volume-issue-number>
<vol-first>43</vol-first>
<suppl>16_suppl</suppl>
</volume-issue-number>

Quick Rule Summary

Situation

Action

<iss-first> empty

Remove the tag

<vol-first> contains text

Convert to <vol-text>

<iss-first> contains *_suppl

Move value to <suppl> tag

🔟 Fix Page Information

Case 1

Agar sirf first page ho

plaintext
E69

to use article number bana do. and isko </volispage> tag ke nichhe rkh do.

xml
<article-number>E69</article-number>

Case 2

Agar first page + last page dono ho

to same rehne do.

xml
<first-page>E69</first-page>
<last-page>E72</last-page>

1️⃣1️⃣ Fix Publication Date

AIP file

xml
<publicationyear first="2025"/>

FINAL file

xml
<publicationyear first="2025"/>

<publicationdate>
<year>2025</year>
<month>09</month>
<day>01</day>
<date-text>September 2025</date-text>
</publicationdate>

Rule

plaintext
Day = 01 → date-text allowed

1️⃣2️⃣ Fix References

Reference part me mostly errors aate hain.

Fix initials

Incorrect

plaintext
Mitja O

Correct

xml
<ce:initials>O.</ce:initials>

Note: after fix this error isko refrence part me jha pr </ref-info> tag close hoga uske just phle rkh dena hai.

Fix <ext-link> in Reference Part

Kabhi-kabhi reference section me ye structure milta hai:

xml
<pt>
<ext-link>https://www.cdc.gov/mpox/hcp/clinical-care/index.html</ext-link>
</pt>

Ye CAR XML format me correct nahi hota, isko fix karna padta hai.

Rule

plaintext
<pt><ext-link> → convert to ref-website structure

✔ Correct

xml
<ref-website>
<ce:e-address type="url">
https://www.cdc.gov/mpox/hcp/clinical-care/index.html
</ce:e-address>
</ref-website>

Quick Rule Summary

Situation

Fix

<pt><ext-link> in reference

Convert to <ref-website>

Website link in reference

Use <ce:e-address type="url">

1️⃣3. Fix Grant Coding

Create grant coding only if funding information is present (usually in acknowledgement or footnote)

Grant Coding Rules

  • Grant coding is created only for FINAL files

  • Do not create grant coding for AIP files

  • <grant-agency-id> must be taken from FundRef.rdf file

find these keywords: support / grant / funding / received in foot-note if present most probably create grant list:

you can follow this template:

xml
<grantlist>
<grant>
<grant-id>XXXX</grant-id>
<grant-acronym>ABC</grant-acronym>
<grant-agency>
<organization>Funding Organization Name</organization>
<country iso-code="XXX"/>
</grant-agency>
<grant-agency-id>XXXX</grant-agency-id>
</grant>
<grant-text>Funding statement from article.</grant-text>
</grantlist>

This prompt is used to generate grant information. Please follow these rules when creating any grant.

plaintext
You are an expert in Elsevier CAR XML grant information extraction.
Your task is to extract all grant and funding support information
from a given article and structure it according to Elsevier CAR XML rules.

=== STEP 1: LOCATE SUPPORT INFORMATION ===
Scan the ENTIRE article for funding/support information in:
- Beginning and end of article
- Footnotes
- Acknowledgement sections
- Funding or financial support sections

Look for these keywords:
- "funded by"
- "supported by"
- "grant"
- "grant number"
- "grant no."
- "financial support"
- "research supported by"
- "thank"
- Any similar funding/support expressions

=== STEP 2: EXTRACT GRANT DETAILS ===
For each grant found, extract the following
if explicitly present in the article:

1. grant-id
   - Grant number, project number, or award ID
   - ONLY capture if explicitly stated in article
   - NEVER fabricate or assume
   - NIFA, project numbers, award IDs qualify
   - If NOT present, OMIT <grant-id> entirely

2. grant-acronym
   - Funding agency acronym
   - ONLY capture if present in FundRef Vocabulary
   - If NOT present, OMIT <grant-acronym> entirely

3. organization
   - Full standardized agency name
   - MUST match FundRef Vocabulary standardized name
   - When only acronym in article, use FundRef to
     find full name

4. country iso-code
   - Three-letter uppercase ISO country code
   - e.g., USA, BEL, GBR, DEU, FRA
   - MUST match FundRef Vocabulary country

5. grant-agency-id
   - Elsevier fundingBodyId from FundRef Vocabulary
   - Found in fundingBodyId column of FundRef

=== STEP 3: CORE RULES ===

GRANT ELEMENT RULES:
- ALWAYS create <grant> for EVERY funding agency found
- Create SEPARATE <grant> for each grant award
- If same agency has multiple grants, create separate
  <grant> element for each grant
- <grant-id> is OPTIONAL, add ONLY if in article
- NEVER skip a <grant> element
- NEVER fabricate any information

GRANT TEXT RULES:
- Capture ALL funding/support text verbatim
- Use ONLY ONE <grant-text> element per <grantlist>
- Do NOT capture section or paragraph headings
- Include both monetary AND non-monetary support
- Include ALL acknowledgement text related to funding
- If multiple support statements exist, combine them
  in order they appear, separated by ". " (full stop
  and space)

GRANT AGENCY RULES:
- ALWAYS use FundRef Vocabulary for:
  * Standardized agency names
  * Agency acronyms
  * fundingBodyId (grant-agency-id)
  * Country codes
- Do NOT use FundRef for Elsevier published content
- Only capture agency names found in FundRef Vocabulary

GRANT ID RULES:
- Capture ONLY when explicitly present in article
- NEVER fabricate or assume grant numbers
- If no grant number mentioned, OMIT <grant-id>
- Examples of valid grant-id: NIFA, 654836,
  FA9550-XX-X-XXXX, project numbers, award IDs

GENERAL RULES:
- Capture ONLY from published article text
- Do NOT follow external website links
- Do NOT assume or invent any information
- Do NOT capture Articles in Press (stage="S200")
- Grant information is NOT captured for
  Articles in Press

=== STEP 4: OUTPUT FORMAT ===

<grantlist>
<grant>
<grant-id>[if explicitly present]</grant-id>
<grant-acronym>[if in FundRef]</grant-acronym>
<grant-agency>
<organization>[FundRef standardized name]</organization>
<country iso-code="[3-LETTER UPPERCASE ISO]"/>
</grant-agency>
<grant-agency-id>[FundRef fundingBodyId]</grant-agency-id>
</grant>
[repeat separate <grant> for each grant award]
<grant-text>[verbatim funding text from article]</grant-text>
</grantlist>

FORMATTING RULES:
- All tags LEFT-ALIGNED, no indentation
- Single <grantlist> wrapper
- <grant-text> always LAST inside <grantlist>
- Omit optional elements if not present
- Do not add empty elements

=== STEP 5: VALIDATION CHECKLIST ===
Before delivering final output verify:
[ ] All funding agencies from article captured
[ ] Separate <grant> created for each grant award
[ ] Agency names match FundRef Vocabulary
[ ] Country codes are 3-letter uppercase ISO
[ ] grant-agency-id matches FundRef fundingBodyId
[ ] <grant-id> only present if explicitly in article
[ ] <grant-id> omitted if not mentioned in article
[ ] Only ONE <grant-text> element
[ ] <grant-text> is verbatim from article
[ ] No section headings in <grant-text>
[ ] Tags are left-aligned
[ ] No fabricated information
[ ] No empty elements
[ ] Only published article content captured
[ ] External links not followed

Quick Reference Card:

Element

Rule

<grant>

Always create for every agency

<grant-id>

Only if explicitly in article

<grant-acronym>

Only if in FundRef Vocabulary

<organization>

Must match FundRef standardized name

<country iso-code>

3-letter uppercase ISO code

<grant-agency-id>

FundRef fundingBodyId number

<grant-text>

Verbatim, single element, no headings

Multiple grants

Separate <grant> per award

Multiple texts

Combined with . separator

Empty elements

Never add

External links

Never follow

Fabricated info

Never add

  1. All country code Here avilable:

Country

3-Letter Code

2-Letter Code

Afghanistan

AFG

AF

Albania

ALB

AL

Algeria

DZA

DZ

American Samoa

ASM

AS

Andorra

AND

AD

Angola

AGO

AO

Anguilla

AIA

AI

Antarctica

ATA

AQ

Antigua and Barbuda

ATG

AG

Argentina

ARG

AR

Armenia

ARM

AM

Aruba

ABW

AW

Australia

AUS

AU

Austria

AUT

AT

Azerbaijan

AZE

AZ

Bahamas

BHS

BS

Bahrain

BHR

BH

Bangladesh

BGD

BD

Barbados

BRB

BB

Belarus

BLR

BY

Belgium

BEL

BE

Belize

BLZ

BZ

Benin

BEN

BJ

Bermuda

BMU

BM

Bhutan

BTN

BT

Bolivia

BOL

BO

Bonaire

BES

BQ

Bosnia and Herzegovina

BIH

BA

Botswana

BWA

BW

Bouvet Island

BVT

BV

Brazil

BRA

BR

British Indian Ocean Territory

IOT

IO

Brunei Darussalam

BRN

BN

Bulgaria

BGR

BG

Burkina Faso

BFA

BF

Burundi

BDI

BI

Cambodia

KHM

KH

Cameroon

CMR

CM

Canada

CAN

CA

Cape Verde

CPV

CV

Cayman Islands

CYM

KY

Central African Republic

CAF

CF

Chad

TCD

TD

Chile

CHL

CL

China

CHN

CN

Christmas Island

CXR

CX

Cocos (Keeling) Islands

CCK

CC

Colombia

COL

CO

Comoros

COM

KM

Congo

COG

CG

Cook Islands

COK

CK

Costa Rica

CRI

CR

Cote d'Ivoire

CIV

CI

Croatia

HRV

HR

Cuba

CUB

CU

Curacao

CUW

CW

Cyprus

CYP

CY

Czechia

CZE

CZ

Democratic Republic Congo

COD

CD

Denmark

DNK

DK

Djibouti

DJI

DJ

Dominica

DMA

DM

Dominican Republic

DOM

DO

Ecuador

ECU

EC

Egypt

EGY

EG

El Salvador

SLV

SV

Equatorial Guinea

GNQ

GQ

Eritrea

ERI

ER

Estonia

EST

EE

Ethiopia

ETH

ET

European Union

EUE

EU

Falkland Islands (Malvinas)

FLK

FK

Faroe Islands

FRO

FO

Federated States of Micronesia

FSM

FM

Fiji

FJI

FJ

Finland

FIN

FI

France

FRA

FR

French Guiana

GUF

GF

French Polynesia

PYF

PF

French Southern Territories

ATF

TF

Gabon

GAB

GA

Gambia

GMB

GM

Georgia

GEO

GE

Germany

DEU

DE

Ghana

GHA

GH

Gibraltar

GIB

GI

Greece

GRC

GR

Greenland

GRL

GL

Grenada

GRD

GD

Guadeloupe

GLP

GP

Guam

GUM

GU

Guatemala

GTM

GT

Guinea

GIN

GN

Guinea-Bissau

GNB

GW

Guyana

GUY

GY

Haiti

HTI

HT

Heard Island and McDonald Islands

HMD

HM

Honduras

HND

HN

Hong Kong

HKG

HK

Hungary

HUN

HU

Iceland

ISL

IS

India

IND

IN

Indonesia

IDN

ID

Iran

IRN

IR

Iraq

IRQ

IQ

Ireland

IRL

IE

Israel

ISR

IL

Italy

ITA

IT

Jamaica

JAM

JM

Japan

JPN

JP

Jordan

JOR

JO

Kazakhstan

KAZ

KZ

Kenya

KEN

KE

Kiribati

KIR

KI

Kuwait

KWT

KW

Kyrgyzstan

KGZ

KG

Laos

LAO

LA

Latvia

LVA

LV

Lebanon

LBN

LB

Lesotho

LSO

LS

Liberia

LBR

LR

Libyan Arab Jamahiriya

LBY

LY

Liechtenstein

LIE

LI

Lithuania

LTU

LT

Luxembourg

LUX

LU

Macao

MAC

MO

North Macedonia

MKD

MK

Madagascar

MDG

MG

Malawi

MWI

MW

Malaysia

MYS

MY

Maldives

MDV

MV

Mali

MLI

ML

Malta

MLT

MT

Marshall Islands

MHL

MH

Martinique

MTQ

MQ

Mauritania

MRT

MR

Mauritius

MUS

MU

Mayotte

MYT

YT

Mexico

MEX

MX

Moldova

MDA

MD

Monaco

MCO

MC

Mongolia

MNG

MN

Montenegro

MNE

ME

Montserrat

MSR

MS

Morocco

MAR

MA

Mozambique

MOZ

MZ

Myanmar

MMR

MM

Namibia

NAM

NA

Nauru

NRU

NR

Nepal

NPL

NP

Netherlands

NLD

NL

New Caledonia

NCL

NC

New Zealand

NZL

NZ

Nicaragua

NIC

NI

Niger

NER

NE

Nigeria

NGA

NG

Niue

NIU

NU

Norfolk Island

NFK

NF

North Korea

PRK

KP

Northern Mariana Islands

MNP

MP

Norway

NOR

NO

Oman

OMN

OM

Pakistan

PAK

PK

Palau

PLW

PW

Palestine

PSE

PS

Panama

PAN

PA

Papua New Guinea

PNG

PG

Paraguay

PRY

PY

Peru

PER

PE

Philippines

PHL

PH

Pitcairn

PCN

PN

Poland

POL

PL

Portugal

PRT

PT

Puerto Rico

PRI

PR

Qatar

QAT

QA

Reunion

REU

RE

Romania

ROU

RO

Russian Federation

RUS

RU

Rwanda

RWA

RW

Saba

BES

BQ

Saint Eustatius

BES

BQ

Saint Helena

SHN

SH

Saint Kitts and Nevis

KNA

KN

Saint Lucia

LCA

LC

Saint Martin (Dutch)

SXM

SX

Saint Martin (French)

MAF

MF

Saint Pierre and Miquelon

SPM

PM

Saint Vincent and the Grenadines

VCT

VC

Samoa

WSM

WS

San Marino

SMR

SM

Sao Tome and Principe

STP

ST

Saudi Arabia

SAU

SA

Senegal

SEN

SN

Serbia

SRB

RS

Seychelles

SYC

SC

Sierra Leone

SLE

SL

Singapore

SGP

SG

Slovakia

SVK

SK

Slovenia

SVN

SI

Solomon Islands

SLB

SB

Somalia

SOM

SO

South Africa

ZAF

ZA

South Georgia and the South Sandwich Islands

SGS

GS

South Korea

KOR

KR

Spain

ESP

ES

Sri Lanka

LKA

LK

South Sudan

SSD

SS

Sudan

SDN

SD

Suriname

SUR

SR

Svalbard and Jan Mayen

SJM

SJ

Swaziland

SWZ

SZ

Sweden

SWE

SE

Switzerland

CHE

CH

Syrian Arab Republic

SYR

SY

Taiwan

TWN

TW

Tajikistan

TJK

TJ

Tanzania

TZA

TZ

Thailand

THA

TH

Timor-Leste

TLS

TL

Togo

TGO

TG

Tokelau

TKL

TK

Tonga

TON

TO

Trinidad and Tobago

TTO

TT

Tunisia

TUN

TN

Turkey

TUR

TR

Turkmenistan

TKM

TM

Turks and Caicos Islands

TCA

TC

Tuvalu

TUV

TV

Uganda

UGA

UG

Ukraine

UKR

UA

United Arab Emirates

ARE

AE

United Kingdom

GBR

GB

United States

USA

US

United States Minor Outlying Islands

UMI

UM

Uruguay

URY

UY

Uzbekistan

UZB

UZ

Vanuatu

VUT

VU

Vatican City State

VAT

VA

Venezuela

VEN

VE

Viet Nam

VNM

VN

Virgin Islands (British)

VGB

VG

Virgin Islands (U.S.)

VIR

VI

Wallis and Futuna

WLF

WF

Western Sahara

ESH

EH

Yemen

YEM

YE

Zambia

ZMB

ZM

Zimbabwe

ZWE

ZW