تعداد نشریات | 42 |
تعداد شمارهها | 1,514 |
تعداد مقالات | 12,490 |
تعداد مشاهده مقاله | 24,776,982 |
تعداد دریافت فایل اصل مقاله | 10,440,092 |
TFDF, not TF-IDF in Financial Analysis | ||
Journal of Computing and Security | ||
مقالات آماده انتشار، پذیرفته شده، انتشار آنلاین از تاریخ 30 مرداد 1402 | ||
نوع مقاله: Research Article | ||
شناسه دیجیتال (DOI): 10.22108/jcs.2023.137652.1128 | ||
نویسندگان | ||
mehran rezaei* 1؛ Meisam Hashemi2؛ Marjan Kaedi2 | ||
1university of isfahan | ||
2University of Isfahan | ||
چکیده | ||
Textual analysis in the realm of business depends on text processing techniques borrowed mainly from information retrieval. Yet, these text processing techniques are not viable in text based financial forecasting. In this paper, we suggest developing financial homegrown techniques for processing textual data, specifically in the course of scoring words where standard techniques are not appropriate in financial analysis. On that matter, we pursue two issues. First, we examine major information retrieval heuristics, where we find TF-IDF too facile not only in predicting trends but also in generating accurate results (in terms of errors) on large numbers in text based financial analysis. Second, we work on a new heuristic satisfying financial concerns. We consider the relationship between the publication rate of information and its importance. The proposed heuristic provides results of unmatchable performance in both predicting trends and precision measures. In an additional analysis, we optimize our scheme using a genetic algorithm as an optimization technique and get greater precision. In comparison with TF-IDF, our proposed heuristic conduces to a 38.5 percent lower error in closeness measures which is again reduced by 16.46 percent with the help of a genetic algorithm. Our findings suggest that researchers in the field of financial textual analysis should not rely on standard information retrieval heuristics. | ||
کلیدواژهها | ||
Financial textual analysis؛ Term weighting؛ Genetic Algorithm؛ Stock | ||
آمار تعداد مشاهده مقاله: 15 |