02348naa a2200277 a 450000100080000000500110000800800410001902400520006010000230011224501290013526000090026452014600027365000150173365000240174865000300177265300270180265300260182965300220185565300290187765300190190665300200192565300230194570000200196870000250198877300570201321638272024-04-23 2024 bl uuuu u00u1 u #d7 ahttps://doi.org/10.1007/s13278-024-01254-52DOI1 aNOGUEIRA, T. da S. aConstruction of a training dataset for a sentiment analysis model of dairy products tweets in Brazil.h[electronic resource] c2024 aCreating specific datasets for machine learning models is a frequent and challenging task, requiring considerable effort in sample collection and maintaining a balanced representation of each class. In this study, our objective was to create a training dataset for a sentiment analysis model by combining results obtained from 5 natural language processing tools through 3 distinct approaches, aiming to automatically label various tweets in the negative, neutral, and positive classes. Additionally, we applied data balancing techniques to assess different methods' impacts on the sentiment analysis models' ability to generalize classes to previously unseen samples. The results demonstrated that the three approaches used to combine tool results and apply balancing techniques provided significantly superior outcomes compared to models with imbalanced datasets. These advancements enabled sentiment analysis models to achieve greater precision and generalization capacity for novel samples. These findings underscore the importance of considering effective data balancing strategies when creating training datasets for machine learning applications, especially in tasks sensitive to class imbalance, such as sentiment analysis. This enhanced approach is crucial to improving the performance and applicability of sentiment analysis models in real-world scenarios, providing more precise data analyses that unveil valuable insights in digital marketing. aLaticĂ­nio aPesquisa de Mercado aProduto Derivado do Leite aArtifcial intelligence aBusiness intelligence aDigital marketing aInteligĂȘncia artificial aMarket digital aMarket research aSentiment analysis1 aSIQUEIRA, K. B.1 aGOLIATT, P. V. Z. C. tSocial Network Analysis and Mininggv. 14, 85, 2024.