The organisation has also built a "general-purpose, multi-language political data-mining system" to search "massive datasets" for specific terms.
The "Syria Files" relate to emails dated August 2006 to March 2012, according to an announcement by WikiLeaks.
"This extraordinary data set derives from 680 Syria-related entities or domain names, including those of the ministries of presidential affairs, foreign affairs, finance, information, transport and culture."
WikiLeaks added it is "statistically confident that the vast majority of the data are what they purport to be."
The whistleblowers' website said that stories relating to the data will be published by WikiLeaks as well as a number of other publishers, including ARD in Germany, Associated Press in the US, L’Espresso in Italy and Owni in France.
In a press conference WikiLeaks outlined a new "data-mining system" which can be used to search for terms in documents, attachments and all file names within the attachments, and can also exclude certain terms.
Journalism.co.uk understands the new "search interface" will not be publicly available, but for use by WikiLeaks and journalists at partner news outlets.
Free daily newsletter
- Tip: How to get started with data journalism in a small newsroom
- Tool for journalists: DataProofer, for identifying errors in datasets
- How to map connections between individuals, companies and more with VIS
- Tip: Use these resources to file a Freedom of Information request in different countries
- Pen, paper and data: Orb's approach to global reporting