We are looking for researchers or applied engineers with industry experience in the computer vision / visual-language modeling field (e.g. multimodal representation learning, visual diffusion models, visual encoders/decoders, etc.) We encourage the team to regularly publish, and the team works in a highly collaborative, research-driven environment, with full access to the Pinterest image-board-style graph for large-scale pre-training.
Pinterest's Advanced Technologies Group (ATG) is the in-house ML research division focused on building consumer AI products. The team works on projects involving LLMs, diffusion models, and other core models in the generative multimodal ML and computer vision space.