Mixture of States: Routing Token-Level Dynamics for Multimodal Generation

Prompt-conditioned comparisons

Use the buttons to switch between prompts. MoS-Image is highlighted on the right.

Qwen-Image Flux Bagel SANA MoS-Image

Prompt 1

A colorful poster with the title at the top in large letters: “Author Meet and Greet on Saturday.” Below the title is a portrait of the author in the center. At the bottom, smaller text reads “Book Signing and Q&A.”

MoS-Image result for prompt 1. — MoS-Image

Prompt 2

On a large wooden table, a variety of foods are arranged in a vibrant display. In the center sits a pepperoni pizza, cut into eight slices, the golden crust slightly charred at the edges, melted cheese stretching between slices, and glossy red pepperoni discs glistening with oil. To the right, a hamburger is stacked high on a white plate: a sesame seed bun with a juicy beef patty, melted cheddar cheese dripping down the sides, layers of green lettuce, red tomato slices, and pickles visible in between, with golden French fries scattered beside it. On the left, a grilled fish is presented on a rectangular platter, its skin crispy and golden-brown with hints of char, garnished with lemon slices placed along its body and fresh parsley sprinkled across. Near the top of the table, a bowl of fruit overflows with color: shiny red apples, bright yellow bananas curving upward, deep purple grapes spilling over the edge, and a cut-open orange revealing its juicy segments. At the front of the scene, a small dessert plate holds a slice of chocolate cake, dark and rich with glossy frosting, topped with a bright red strawberry. The entire table is lit with soft natural light, creating highlights on the glossy fruit skins, reflections on the melted cheese, and warm shadows under the plates, giving the display a fresh and appetizing look.

MoS-Image result for prompt 2. — MoS-Image

Prompt 3

A Chinese restaurant menu poster with a solid black background and golden decorative borders. At the top, in large bold letters, the heading says “Today’s Specials.” The appetizers section lists: “Spring Rolls - ¥18,” “Dumplings - ¥22,” “Hot and Sour Soup - ¥20.” The main dishes section displays in larger text: “Kung Pao Chicken - ¥45,” “Braised Beef - ¥55,” “Eggplant in Garlic Sauce - ¥38.” At the bottom, the desserts section reads: “Sesame Balls - ¥25,” “Mango Pudding - ¥28.” All menu items are written in clear white letters against the black background, with the prices shown directly beside each dish.

MoS-Image result for prompt 3. — MoS-Image

Prompt 4

The image is an advertisement for a GPS tracking device designed for dogs, featuring a brown dog running in the woods and a close-up of the device. In the foreground, a brown dog and a yellow collar is prominently displayed, running on a dirt path surrounded by trees. To the right of the dog, a text overlay reads "LIVE GPS TRACKING" in black font within a yellow rectangle, followed by "NEVER HAVING TO HOPE SOMEONE SCANS THEIR MICROCHIP" in white font. This text highlights the key benefit of the product. In the bottom center of the image, a close-up view of the GPS tracking device is shown. The device is black with a yellow strap and features the letters "MoS" in white on its front. The strap is made of a textured material and has a black plastic buckle. The overall design of the device appears sleek and modern. The background of the image is a blurred forest scene, with trees and foliage visible behind the dog. The atmosphere is one of freedom and adventure, as the dog runs through the woods with ease.

MoS-Image result for prompt 4. — MoS-Image

Prompt 5

The image is divided into two sections. The left side features a dark teal background with a grid pattern, accompanied by large white text that reads "Owner makes $131,150 IN 10 MONTHS WHEN PARTNERING WITH MoS." The word "MoS" is displayed in teal and yellow font below the main text. On the right side of the image, there is a photograph of a house situated in a wooded area. The house has a dark green exterior with white trim around the windows and doors. A small porch is visible at the front entrance, which is flanked by two lanterns on either side. The roof appears to be made of metal, and the surrounding landscape includes trees and bushes. A wooden walkway leads up to the front door, adding to the overall aesthetic appeal of the property.