Inputs
Reference 1: User’s uploaded photo
Reference 2:
Jersey Number:
Jersey Team Name: (team of the jersey being held)
User Outfit:
Mood:
Prompt
Create a photorealistic image of the person from the user’s uploaded photo standing next to pitchside in front of the stadium stands, posing for a photo.
Location: Pitchside/touchline in a large stadium. Natural grass and advertising boards look realistic.
Stands: The background stands must feel 100% like ’s team home crowd (single-team atmosphere). Dominant team colors, scarves, flags, and banners. No rival-team colors or mixed sections visible.
Composition: Both subjects centered, shoulder to shoulder. can place one arm around the user.
Prop: They are holding a jersey together toward the camera. The back of the jersey must clearly show and the number . Print alignment is clean, sharp, and realistic.
Critical rule (lock the held jersey to a specific team)
The jersey they are holding must be an official kit design of .
Keep the jersey colors, patterns, and overall design consistent with .
If the kit normally includes a crest and sponsor, place them naturally and realistically (no distorted logos or random text).
Prevent color drift: the jersey’s primary and secondary colors must stay true to ’s known colors.
Note: must not be the club currently plays for.
Clothing:
: Wearing his current team’s match kit (shirt, shorts, socks), looks natural and accurate.
User:
Camera: Eye level, 35mm, slight wide angle, natural depth of field. Focus on the two people, background slightly blurred.
Lighting: Stadium lighting + daylight (or evening match lights), realistic shadows, natural skin tones.
Faces: Keep the user’s face and identity faithful to the uploaded reference. is clearly recognizable. Expression:
Quality: Ultra realistic, natural skin texture and fabric texture, high resolution.
Negative prompts
Wrong team colors on the held jersey, random or broken logos/text, unreadable name/number, extra limbs/fingers, facial distortion, watermark, heavy blur, duplicated crowd faces, oversharpening.
Output
Single image, 3:2 landscape or 1:1 square, high resolution.